(19) 



J 



(12) 



(43) Date of publication: 

07.03.2001 Bulletin 2001/10 

(21) Application number: 00119003.2 

(22) Date of filing: 01 .09.2000 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets (11) EP 1 081 967 A2 

EUROPEAN PATENT APPLICATION 

(51) mtci. 7 : H04N 9/82 



(84) 


Designated Contracting States: 


(72) Inventors: 




AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


• Fukuda, Hideki 




MCNLPTSE 


Nara-shi, Nara 631-0843 (JP) 




Designated Extension States: 


• Kondo, Satoshi 




ALLTLVMKRO SI 


Yawata-shi, Kyoto 614-8361 (JP) 


(30) 


Priority: 02.09.1999 JP 24834699 


(74) Representative: 






Griinecker, Kinkeldey, 


(71) 


Applicant: 


Stockmair & Schwanhausser 




Matsushita Electric Industrial Co., Ltd. 


Anwaltssozietat 




Kadoma-shi, Osaka 571-8501 (JP) 


Maximilianstrasse 58 






80538 Munchen (DE) 



CM 
< 

CO 

co 



(54) Recording apparatus and coding apparatus 

(57) A recording apparatus comprises a detector 
107 for detecting a video attribute, a detector 109 for 
detecting an audio attribute, and an address information 
generator 108 for generating specific address informa- 
tion Isa which indicates a data write address at the time 
when the video attribute or the audio attribute changes, 
on the basis of the outputs from the detectors 107 and 
109. The specific address information Isa is written in a 

Fig.1 



recording area of an optical disk 1 06, which area corre- 
sponds to VMGI 10b1 constituting management infor- 
mation (VMG information) 10a for each VOB. Therefore, 
when playing an audio video stream recorded on the 
optical disk 106, quick access can be made to the 
recording position in the stream where the attribute 
change occurs. 
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Description 

FIELD OF THE INVENTION 

[0001] The present invention relates to a recording 
apparatus and a coding apparatus and, more particu- 
larly, to an apparatus for recording an audio video signal 
on a recording medium, and an apparatus for coding a 
video signal. 

BACKGROUND OF THE INVENTION 

[0002] On a playback-only DVD (Digital Video 
Disk), coded data which is obtained by performing com- 
pressive coding on an audio video signal corresponding 
to a specific program or the like, is recorded. This audio 
video signal includes an audio signal and a video signal, 
and coded audio data obtained by coding the audio sig- 
nal and coded video data obtained by coding the video 
signal are recorded on the DVD as the above-described 
coded data. Further, the coded audio data and the 
coded video data are respectively packed. That is, 
these coded data are divided into plural pieces of data 
corresponding to first data units each having a predeter- 
mined data size (e.g., 2048 bytes) . 
[0003] In the following description, the coded audio 
data corresponding to the first data units are referred to 
as audio packs, and the coded video data correspond- 
ing to the first data units are referred to as video packs. 
[0004] These audio packs and video packs are mul- 
tiplexed and recorded on the DVD. 
[0005] Furthermore, the coded data recorded on 
the DVD are divided into plural pieces of data corre- 
sponding to second data units each including plural 
pieces of the first data units, and the coded data are 
managed in the second data units. The coded data cor- 
responding to the second data units are referred to as 
video objects (VOB). 

[0006] For example, the coded data corresponding 
to one program is composed of at least one VOB. In the 
video standard relating to DVDROM, a group compris- 
ing at least one VOB is called a video object set 
(VOBS), and it is recorded on the DVD as one title. 
[0007] Furthermore, on the DVD, together with the 
coded data corresponding to each title (program), the 
following information is recorded as information for 
managing the title (program) : information indicating the 
position in the recording area of the DVD where the 
coded data corresponding to this title is recorded 
(recording position information), video attribute informa- 
tion corresponding to the coded data, and audio 
attribute information corresponding to the coded data. 
Each information is recorded in VOB units, as men- 
tioned above. 

[0008] The recording position information is various 
kinds of address information in the recording area of the 
DVD, for example, a header address and an end 
address of the area where the coded data of the VOBS 



corresponding to the title is recorded, or an address 
indicating the search point which has previously set by 
the user. 

[0009] The video attribute information relates to the 
5 compressive coding mode, TV system, aspect ratio, dis- 
play mode, etc. 

[0010] There are two types of compressive coding 
modes for DVD: a mode based on the MPEG-1 coding, 
and a mode based on the MPEG-2 coding. The com- 
10 pressive coding mode information indicates that the 
coded data of each VOB corresponds to either the 
former mode or the latter mode. 

[0011] There are two types of TV systems: a sys- 
tem corresponding to the NTSC system (number of 

15 lines: 525, frame frequency: 59.97Hz), arid a system 
corresponding to the PAL system (number of lines: 625, 
frame frequency: 50Hz). The TV system information 
indicates that the coded data of each VOB corresponds 
to either the former system or the latter system. 

20 [001 2] The aspect ratio is the ratio of the image size 
in the horizontal direction to the image size in the verti- 
cal direction, and two ratios, 4:3 and 16:9, are practi- 
cally adopted. The aspect ratio information indicates 
that the coded data of each VOB corresponds to either 

25 4:3 or 16:9. 

[0013] Further, the display mode is a method of 
image display based on the video signal obtained from 
the coded data. For example, it is the pan & scan display 
mode or the letter box display mode. The display mode 

30 information indicates a display mode by which the video 
signal obtained from the coded data of each VOB is to 
be displayed. 

[0014] In the pan & scan display mode, a wide 
image having an aspect ratio of 16:9 is displayed on a 

35 standard screen having an aspect ratio of 4:3, by 
removing left and right sides of the wide image. In the 
letter box display mode, a wide image having an aspect 
ratio of 1 6:9 is displayed on a standard screen having 
an aspect ratio of 4:3, by adding regions of a predeter- 

40 mined color to top and bottom sides of the wide image. 
[0015] Meanwhile, there is the MPEG coding as an 
international standard of a compressive coding method 
for a video signal (hereinafter also referred to as image 
data) . In the MPEG coding, the process of coding the 

45 image data is adaptive ly switched between the intra- 
frame coding in which the image data is coded using the 
correlation of pixel values in one frame, and the inter- 
frame coding in which the image data is coded using the 
correlation of pixel values between frames. In the 

so MPEG coding, the coded data corresponding to contin- 
uous plural frames are regarded as one unit, and the 
image comprising the continuous plural frames is called 
a group of pictures (GOP) . 

[0016] To be specific, in the MPEG coding, the 
55 image data of at least one frame among the plural 
frames constituting the GOP is subjected to the intra- 
frame coding while the image data of the remaining 
frames are subjected to the inter-frame coding. 
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[0017] There are two types of inter-frame coding: 
forward direction inter-frame predictive coding, and bi- 
directional inter-frame predictive coding. A frame to be 
subjected to the forward inter-frame predictive coding is 
called a P frame, and a frame to be subjected to the bi- 5 
directional inter-frame predictive coding is called a B 
frame. The image data of the P frame is subjected to the 
predictive coding with reference to the image data of a 
frame (reference frame) positioned before the P frame. 
The image data of the B frame is subjected to the pre- 10 
dictive coding with reference to the image data of two 
frames (reference frames) which are positioned close to 
and before and after the B frame. Usually, when coding 
a P frame, an I frame close to the P frame is used as a 
reference frame. When coding a B frame, an I frame 75 
and a P frame (or two P frames) which are close to the 
B frame are used as reference frames. 
[0018] Figure 17 is a diagram for explaining an 
example of the structure of the GOP, wherein continu- 
ous plural frames F(k-5)~ F(k+12) are associated 20 
with coded data D(k-5)~D(k+12) corresponding to the 
respective frames. In figure 17, k is an arbitrary integer. 
[0019] One GOP is composed of twelve frames 
from the B frame F(k-2) to the P frame F(k+9) . For 
example, the P frame F(k+3) is subjected to the inter- 25 
frame predictive coding with reference to the I frame 
F(k) . Further, the P frame F(k+6) is subjected to the 
inter-frame predictive coding with reference to the P 
frame F(k+3) . Further, the B frames F(k-M) and F(k+2) 
are subjected to the inter-frame predictive coding with 30 
reference to the I frame F(k) and the P frame F(k+3). 
[0020] The coded data corresponding to the 
respective frames, which are obtained in the above- 
described coding process, are subjected to the process 
of changing the arrangement of the coded data from the 35 
arrangement according to the order of displaying the 
images of the respective frames to the arrangement 
according to the order of decoding the respective 
frames, thereby reducing the capacity of a memory 
used for decoding (rearrangement process). To be spe- 40 
cific, as shown in figure 1 7, in the arrangement obtained 
by subjecting the coded data corresponding to the GOP 
to the above-mentioned rearrangement process, the 
coded data D(k) of the I frame F(k) is positioned at the 
head of the GOP, followed by the coded data D(k-2) of 45 
the B frame F(k-2), the coded data D(k-1 ) of the B frame 
F(k-1), and the coded data D(k+3) of the P frame 
F(k+3). 

[0021] Then, the coded data corresponding to the 
GOP is recorded on a recording medium or transmitted so 
through a transmission medium, according to the order 
after the rearrangement process. 

[0022] By the way, header information of a video 
stream (coded data obtained by coding a video signal) 
based on the MPEG standard includes, as video resolu- 55 
tion information, information relating to the horizontal 
and vertical image sizes, frame frequency, and aspect 
ratio. Further, the header information includes informa- 



tion for recognizing that the video stream corresponds 
to either an interlace signal or a progressive signal. 
[0023] In the standard relating to DVD, coded data 
which corresponds to at least one GOP and is equiva- 
lent to a display time longer than 0.4 sec. and shorter 
than 1 .0 sec. is defined as a video object unit (VOBU) 
(third data unit), and a VOB is composed of a plurality of 
VOBU. 

[0024] Each VOBU includes a plurality of packs 
(first data units), and the head position of the VOBU 
matches the head position of the packs. Further, at the 
head of the VOBU, a pack called a navigation pack, 
including information such as playback control informa- 
tion (PCI) and data search information (DSI), is placed. 
[0025] In the field of television broadcasting, CS 
(Communication Satellite) broadcasting takes the lead 
in digitization, and digital broadcasting of a high-vision 
TV signal will be started subsequent to digital broad- 
casting of a standard TV signal. Accordingly, it is sup- 
posed that a standard TV signal and a high-vision TV 
signal will coexist in one broadcast sequence, or an 
interlace signal and a progressive signal will coexist in 
one broadcast sequence. In this case, the video resolu- 
tions of the TV signals which are broadcast in the same 
broadcast sequence, will change at a change of pro- 
gram. 

[0026] In such digital TV broadcasting, a video 
stream and an audio stream are multiplexed according 
to the MPEG standard to be transmitted as a transport 
stream. 

[0027] On the other hand, when recording the video 
stream and the audio stream on a DVD, the transport 
stream including the video stream and the audio stream 
is converted to a program stream to make the DVD have 
trick play functions, and this program stream is recorded 
on the DVD. Accordingly, when the audio video stream 
obtained from the received digital TV broadcast signal is 
recorded on the DVD, the audio video stream must be 
converted from the transport stream to the program 
stream (TS/PS conversion), and techniques relating to 
such stream conversion have already been developed. 
For example, Japanese Published Patent Application 
No. Hei. 11-45512 (HITACHI) discloses a technique of 
TS/PS conversion, i.e., a technique for converting a 
transport stream to a program stream. 
[0028] Hereinafter, a description will be given of the 
standard for recording the program stream on a record- 
ing medium such as an optical disk. 
[0029] Figure 1 8 is a diagram for explaining the for- 
mat of recorded data 10 based on the recording stand- 
ard, and illustrates the specific contents of video 
attribute information (V ATR) 10d1 . 
[0030] The recorded data 1 0 is data recorded on a 
recording medium by a recorder based on the above- 
described recording standard, and this recorded data 
10 is composed of a video manager (VMG) 10a, and 
three video objects: a video object (VOB(1)) 10a1, a 
video object (VOB(2)) 10a2, and a video object 
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(VOB(3)) I0a3. Each of the VOB(1) I0a1 ~ VOB(3) 
1 0a3 includes an audio video stream, and the VMG 1 0a 
includes management information for each VOB. 
[0031] The recorded data 10 corresponds to one 
TV broadcast program. Further, the recorded data 10 is 
divided into three VOBs according to the above-men- 
tioned recording standard because the user performed 
two times of pause operations while recording the audio 
video stream of this program. That is, the boundary 
between the VOB(1) 10a1 and the VOB(2) 10a2 corre- 
sponds to the first pause position, and the boundary 
between the VOB(2) 10a2 and the VOB(3) 10a3 corre- 
sponds to the second pause position. In other words, in 
the recording process based on the DVD recording 
standard, when recording of the audio video stream is 
paused, the streams before and after the pause position 
are recorded as different VOBs on the recording 
medium. 

[0032] As described above, the VMG 1 0a is man- 
agement information for each VOB recorded, and the 
VMG 10a is composed of video manager information 
(VMGI) 1 0bi and an audio video file information table 
(AVFIT) 10b2. The VMGI 10b1 includes, as search 
information, information relating to the time when each 
VOB was recorded on the recording medium (recording 
time), and the address of the recording area in the 
recording medium corresponding to each VOB (record- 
ing address) . 

[0033] Further, the AVFIT 10b2 includes audio 
video file table information (AVFITI) 10c, and plural 
pieces of video object stream information (VOB_STI) as 
many as the number of the recorded VOBs, i.e., 
VOB_STI(1) 10c1, VOB_STI(2) 10c2, and VOB_STI(3) 
10c3. The AVFITI 10c includes information about the 
number of the recorded VOBs, and the like. Each of the 
VOB_STI 10c1~10c3 includes the attribute information 
of the corresponding VOB. For example, the 
VOB_STI(1) 10c1 is composed of video attribute infor- 
mation (V_ATR) 10d1 and audio attribute information 
(A_ATR) 1 0d2. 

[0034] Hereinafter, the video attribute information 
(V_ATR) 10d1 will be described in detail. 
[0035] The V_ATR 10d1 includes compression 
mode information 10e1, horizontal video resolution 
(H_video resolution) information 10e2, vertical video 
resolution (V_video resolution) information 10e3, frame 
frequency information 10e4, TV system information 
10e5, and, aspect ratio information 10e6. 
[0036] The compression mode information 10e1 is 
information for recognizing that the video stream of 
each VOB is based on either the MPEG-1 coding or the 
MPEG-2 coding. 

[0037] In the horizontal video resolution information 
10e2, information for identifying the frame size in the 
horizontal direction corresponding to each VOB is 
described. To be specific, as the number of pixels in the 
horizontal direction, any of the following values is 
described: 352, 480, 544, 704, 720, 1440, 1920. 



[0038] In the vertical video resolution information 
10e3, information for identifying the frame size in the 
vertical direction corresponding to each VOB is 
described. To be specific, as the number of scanning 
5 lines, any of the following values is described: 240, 480, 
576, 720, 1080. 

[0039] The frame frequency information 10e4 is 
information for identifying the frame frequency corre- 
sponding to each VOB. For example, it shows any of the 
w following frequencies: 24Hz, 29.97Hz, 30Hz, 25Hz, 
50Hz, 60Hz. 

[0040] The TV system information 10e5 is informa- 
tion for identifying that the video signal corresponding to 
each VOB is either an interlace signal or a progressive 
15 signal. 

[0041 ] The aspect ratio information 1 0e5 is informa- 
tion for identifying the aspect ratio of the video signal 
corresponding to each VOB. For example, it shows the 
value of the aspect ratio (4:3 or 1 6:9), or the type of the 
20 letter box. 

[0042] While the V_ATR 10d1 shown in figure 18 
includes the compression mode information 10e1, the 
horizontal video resolution information 1 0e2, the vertical 
video resolution information 10e3, the frame frequency 
25 information 1 0e4, the TV system information 1 0e5, and 
the aspect ratio information 10e6, the V_ATR 10d1 may 
include caption data information (Line21_ switch) 10e7 
in addition to the information 10e1~10e6 as shown in 
figure 1 9. The caption data information 1 0e7 is informa- 
nt? tion for identifying whether each of the video signals in 
the first and second fields includes Line21 data or not. 
The Line21 data is closed caption data which is super- 
posed on a portion of the video signal corresponding to 
the 21st line. 

35 [0043] Next, the audio attribute information 
(A_ATR) 1 0d2 will be described in detail. 
[0044] Figure 20 is a diagram for explaining the for- 
mat of the above-described recorded data, and illus- 
trates the specific contents of the audio attribute 

40 information (A_ATR) 10d2. 

[0045] The A_ATR 1 0d2 includes, as information for 
identifying the attribute of the audio signal correspond- 
ing to each VOB, coding mode information 1 0f 1 , quanti- 
zation information 10f2, dynamic range control (DRC) 

45 information 10f3, sampling frequency (fs) information 
10f4, number-of-audio-channels information 10f5, and 
audio bit rate information 10f6. 

[0046] The coding mode information 1 0f 1 is infor- 
mation for identifying the type of the audio stream corre- 
50 sponding to each VOB. For example, it shows that the 
audio stream corresponds to any of the following 
modes: Dolby AC3, MPEG-1, MPEG-2, and Linear 
PCM (Pulse Code Modulation) . 

[0047] The quantization information 1 0f 2 is informa- 
55 tion for identifying the number of quantized bits (1 6 bits, 
20 bits, 24 bits, etc.) in the case where the audio stream 
corresponding to each VOB is subjected to Linear PCM. 
[0048] The dynamic range control information 10f3 
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is information for identifying whether or not the audio 
stream corresponding to each VOB includes dynamic 
range control data in the MPEG-1 or MPEG-2 coding. 
[0049] The sampling frequency information 10f4 is 
information for identifying the sampling frequency 
(48kHz, 96kHz, etc.) of the audio stream corresponding 
to each VOB. 

[0050] The number-of-audio-channels information 
10f5 is information for identifying the number of chan- 
nels (1ch(mono), 2ch(stereo), 2ch(dual mono), 3ch, 
4ch, 5ch, 6ch, 7ch, 8ch, etc.) of the played audio signal 
obtained from the audio stream corresponding to each 
VOB. 

[0051] The audio bit rate information 10f6 is infor- 
mation for identifying the bit rate (64kbps, 89kbps, 
112kbps, 128kbps, 160kbps, 192kbps, 224kbps, 
256kbps, 320kbps, 384kbps, 448kbps, 768kbps, 
1536kbps, etc.) of the audio stream corresponding to 
each VOB. 

[0052] It is possible to recognize the recording time 
of the audio video stream corresponding to each VOB, 
the recording address thereof, and the attribute informa- 
tion of each VOB, by reading the VMG 10a from the 
data 1 0 recorded on the recording medium (optical 
disk). 

[0053] Next, the structure of the VOB in the 
recorded data 1 0 will be described with reference to fig- 
ures 21(a); 21(b), 22(a) and 22(b). Figures 21(a) and 
22(a) show the detailed structure of the VOB(1 ) 1 0a1 in 
the recorded data 10. 

[0054] The VOB(1) 10a1 is composed of plural 
video object units, i.e., VOBU(1) 10g1, VOBU(2) 10g2, 
VOBU(3) 10g3, VOBU(4) 10g4, VOBU (n) lOgn. 
[0055] One VOBU includes an audio video stream 
which corresponds to at least one GOP and is equiva- 
lent to a display time of 0.4-1 .0 sec. For example, the 
VOBU(1) 10g1 is composed of plural video packs 

(V_PCK) 10h(1), 10h(2), 10h(3) 10h(r), and plural 

audio packs (A_PCK) 10i(1), 10i(2) , 10i(s). Each of 
the video packets and audio packets has a predeter- 
mined data size, and the data size is 2048 bytes in this 
case. 

[0056] Figure 21(b) shows the video packs 
10h(1)~10h(8) associated with the streams of the 
respective frames constituting the GOP. 
[0057] The VOBU(1) 10g1 includes a video stream 
corresponding to one GOP. To be specific, the video 
stream included in the VOBU(1) 10g1 is composed of I 
frame coded data Dv1, B frame coded data Dv2 and 
Dv3, P frame coded data Dv4, B frame coded data Dv5 
and Dv6, P frame coded data Dv7, B frame coded data 
Dv8 and Dv9, and padding data Dpud. 
[0058] Since each VOBU is composed of the video 
packs and audio packs each having 2048 bytes, the 
data size of the VOBU should be an integer multiple of 
2048 bytes. So, the padding data Dpud is added to the 
video stream corresponding to one GOP to make the 
data size of the video stream included in the VOBU 



equal to an integer multiple of 2048 bytes. 
[0059] Figure 22(b) shows the audio packs 
10i(1)~10i(4) constituting the VOBU(1) 10g1, associ- 
ated with the streams of the respective audio frames. 

5 [0060] The VOBU(1) 10g1 includes an audio 
stream corresponding to one GOP. To be specific, the 
audio stream included in the VOBU(1) 10g1 is com- 
posed of coded data of the audio frames Da1 Da8 and 
the padding data Dpud1~Dpud4 corresponding to the 

10 respective audio packs (A_PCK) 1 0i(1)~1 0i(4) . That is, 
the A_ PCK 1 0i(1 ) includes the coded data of the audio 
frames Da1 and Da2 and the padding data Dpudl , the 
A_PCK 10i(2) includes the coded data of the audio 
frames Da3 and Da4 and the padding data Dpud2, the 

15 A_PCK 10i(3) includes the coded data of the audio 
frames Da5 and Da6 and the padding data Dpud3, and 
the A_PCK 10i(4) includes the coded data of the audio 
frames Da7 and Da8 and the padding data Dpud4. 
[0061 ] As described above, the audio stream of one 

20 A_PCK 10i(s) is composed of the coded data of two 
audio frames and the padding data to make the data 
size of the audio stream included in the VOBU equal to 
an integer multiple of 2048 bytes. 

[0062] While in figures 21(b) and 22(b) only the 
25 structures of the VOB 10a1 and VOBU(1) 10g1 are 
described, the VOB 10a2 and VOB 10a3 and the 
VOBU(2) 10g2 ~ VOBU(n) 10gn also have the same 
structures as those of the VOB 10a1 and VOBU(1) 
1 0g1 , respectively. 
30 [0063] As described above, since the VOBU has 
the data structure which is divided into data units each 
having a predetermined data size (2048 bytes), address 
management for the VOBU is simplified, and access to 
data in VOBU units on the recording medium is facili- 
35 tated. 

[0064] In the conventional recording apparatus, 
however, the audio video stream is usually recorded as 
one VOB, and management of its attribute is performed 
for the whole stream in the lump. This results in various 

40 drawbacks as follows. 

[0065] When the conventional recording apparatus 
records the audio video stream, the stream inputted to 
the recording apparatus from when the recording is 
started to when the recording is ended, is recorded as 

45 one VOB on the recording medium. However, when the 
recording operation is paused, the stream inputted 
before the pause and the stream inputted after the 
pause are recorded as different VOBs on the recording 
medium. Further, there are other causes by which the 

so audio video stream is recorded as plural VOBs. 

[0066] In other words, in the conventional recording 
apparatus, the audio video stream corresponding to one 
program is usually recorded as one VOB on the record- 
ing medium. When the recording of the stream is 

55 paused to prevent a portion of the stream correspond- 
ing to a CM (commercial) or the like from being 
recorded, the audio video stream corresponding to one 
program is recorded as different VOBs. 
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[0067] By the way, since a digital broadcast signal is 
usually inputted to the recording apparatus as an MPEG 
stream (audio video stream corresponding to the MPEG 
standard), when the digital broadcast signal is received 
and recorded, the video attribute (e.g., video resolution) 
changes from scene to scene during the recording, in 
contract with recording of an analog broadcast signal. 
For example, the resolution in the horizontal direction 
changes from 720 pixels to 312 pixels. 
[0068] Further, when a standard TV broadcast pro- 
gram and a high-vision TV broadcast program are 
recorded continuously, the aspect ratio, which is an 
attribute of the recorded data corresponding to each 
program, changes from 4:3 to 16:9 at the point where 
the program changes. 

[0069] In this case, two audio video streams having 
different attributes are recorded as one VOB, whereby 
management of the video attribute of the stream 
recorded on the recording medium becomes difficult. 
[0070] Further, in the process of playing the audio 
video stream recorded on the recording medium, since 
the video stream (coded data) is decoded with the video 
resolution as a parameter, management of the video 
resolution as the video attribute is insufficient. If the 
change of the video resolution is not posted to the 
decoder, decoding of the video stream results in failure. 
[0071] Furthermore, since the recording position of 
the audio video stream is managed by the address cor- 
responding to each VOBU which is a constituent of the 
VOB, direct access cannot be made to the position 
where the video resolution changes, in the stream 
included in the VOBU. Therefore, random access to the 
beginning of the program, based on the change of the 
video resolution, cannot be performed at high speed. 
[0072] Moreover, although the coded data included 
in the VOBU is divided into data units (packs) each hav- 
ing a predetermined data size (2048 bytes) and each 
pack is managed by the address indicating the record- 
ing position of its head data, the position corresponding 
to the video resolution change point in the stream is not 
always equal to the head position of the pack as the 
access unit and, therefore, the video resolution change 
position in the stream cannot accessed by using the 
address indicating the recording position of the pack. 
[0073] Furthermore, when the video signal is coded 
by the MPEG-2 coding, the coding mode for each frame 
is decided based on a predetermined rule, and coding is 
performed so that a GOP comprising at least one I 
frame, plural P frames, and plural B frames is consti- 
tuted according to the decided coding mode. Therefore, 
when the video resolution changes, the video resolution 
may vary between the frames constituting one GOP, 
and a stream adapted to the MPEG-2 standard cannot 
be generated. 

[0074] Furthermore, in the MPEG coding, the inter- 
frame predictive coding is carried out, that is, the video 
signal of the target frame to be processed is coded with 
reference to the video signal of the reference frame 



which has already been coded. Therefore, when the 
video resolution changes, the video resolution varies 
between the target frame and the reference frame, and 
the inter-frame predictive coding results in failure. 
5 [0075] Also when recording the coded data 
obtained by coding the TV signal whose aspect ratio 
changes, the following drawbacks will occur like in the 
above-described case where the video resolution 
changes. 

10 [0076] That is, when the video aspect ratio 
changes, management of the video attribute of the 
recorded audio video stream (coded data becomes dif- 
ficult. Further, in the playback process, decoding of the 
coded data results in failure due to the change of the 

15 video aspect ratio. Moreover, quick access cannot be 
made to the position where the video aspect ratio 
changes, in the video stream. 

[0077] Moreover, in the MPEG coding, when the 
video aspect ratio changes, a stream adapted to the 
20 MPEG-2 standard cannot be generated, or the inter- 
frame predictive coding results in failure, as in the 
above-described case where the video resolution 
changes. 

[0078] With respect to a video signal having plural 
25 video attributes not only the video resolution and the 
video aspect ratio but also the coding mode and the like, 
access cannot be made to the position where the video 
attribute changes, in the recorded video stream. 
[0079] Further, with respect to an audio video signal 
30 including not only a video signal but also an audio signal 
appended to the video signal, access cannot be made 
to the position where the audio attribute (coding mode, 
number of channels, bit rate, etc.) of the audio signal 
changes, in the recorded data corresponding to this 
35 audio video signal. 

SUMMARY OF THE INVENTION 

[0080] The present invention is made to solve the 

40 above-described problems, and it is an object of the 
present invention to provide a recording apparatus 
which can record coded data corresponding to a TV sig- 
nal in which at least one of the video attribute (coding 
mode, video resolution, video aspect ratio, or the like) 

45 and the audio attribute (coding mode, number of chan- 
nels, bit rate, or the like) changes, so as to enable quick 
access to the position where the attribute changes. 
[0081 ] It is another object of the present invention to 
provide a recording apparatus which can record coded 

so data corresponding to a TV signal in which at least one 
of the video attribute and the audio attribute changes, 
on a recording medium, while managing the coded data 
so as to enable satisfactory decoding of the coded data. 
[0082] It is still another object of the present inven- 

55 tion to provide a coding apparatus which can code a 
video signal in which the video attribute (coding mode, 
video resolution, video aspect ratio, or the like) 
changes, according to a predetermined coding method 
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such as MPEG. 

[0083] Other objects and advantages of the inven- 
tion will become apparent from the detailed description 
that follows. The detailed description and specific 
embodiments described are provided only for illustra- 5 
tion since various additions and modifications within the 
scope of the invention will be apparent to those of skill in 
the art from the detailed description. 
[0084] According to a first aspect of the present 
invention, there is provided a recording apparatus for 10 
recording, on a recording medium, an audio video 
stream which is obtained by coding an audio video sig- 
nal including an audio signal and a video signal. This 
apparatus comprises an attribute detector for detecting 
an attribute relating to at least one of the video signal 15 
and the audio signal on the basis of the audio video 
stream, and outputting attribute data indicating the 
attribute; an information generator for detecting a 
recording position in the audio video stream recorded 
on the recording medium, which recoding position cor- 20 
responds to a point where the attribute changes, or a 
recording time of the audio video stream based on a ref- 
erence time, which recording time corresponds to the 
attribute change point, and outputting attribute change 
information indicating the recording position or the 25 
recording time; and a recorder for recording the attribute 
data and the attribute change information on the record- 
ing medium. Therefore, when playing the audio video 
stream recorded on the recording medium, a portion of 
the audio video stream where the video attribute or the 30 
audio attribute changes can be recognized, whereby 
the portion of the stream where the attribute change 
occurs can be accessed quickly. 

[0085] According to a second aspect of the present 
invention, in the recording apparatus of the first aspect, 35 
the attribute detector detects a video attribute relating to 
the video signal and an audio attribute relating to the 
audio signal, and outputs attribute data indicating each 
attribute; and the recorder records the attribute data 
indicating the video attribute and the attribute data indi- 40 
eating the audio attribute in predetermined recording 
areas of the recording medium, respectively. Therefore, 
when playing the audio video stream, a portion of the 
stream where either the video attribute or the audio 
attribute changes can be accessed quickly. 45 
[0086] According to a third aspect of the present 
invention, in the recording apparatus of the first aspect, 
the attribute detector detects the video resolution of the 
video signal as an attribute relating to the video signal, 
and outputs video resolution data indicating the video so 
resolution; on the basis of the video resolution data, the 
information generator detects a recording position in the 
stream recorded on the recording medium, which 
recording position corresponds to a point where the 
video resolution changes, or a recording time of the 55 
stream based on a reference time, which recording time 
corresponds to the video resolution change point, and 
outputs attribute change information indicating the 



recording position or the recording time; and the 
recorder records the video resolution data and the reso- 
lution change information on the recording medium. 
Therefore, when playing the audio video stream, a por- 
tion of the stream where the video resolution changes 
can be accessed quickly. 

[0087] According to a fourth aspect of the present 
invention, in the recording apparatus of the first aspect, 
the attribute detector detects the aspect ratio of the 
video signal as an attribute relating to the video signal, 
and outputs aspect ratio data indicating the aspect ratio; 
on the basis of the aspect ratio data, the information 
generator detects a recording position in the stream 
recorded on the recording medium, which recording 
position corresponds to a point where the aspect ratio 
changes, or a recording time of the stream based on a 
reference time, which recording time corresponds to the 
aspect ratio change point, and outputs aspect ratio 
change information indicating the recording position or 
the recording time; and the recorder records the aspect 
ratio data and the aspect ratio change information on 
the recording medium. Therefore, when playing the 
audio video stream, a portion of the stream where the 
aspect ratio changes can be accessed quickly. 
[0088] According to a fifth aspect of the present 
invention, there is provided a recording apparatus for 
recording, on a recording medium, an audio video 
stream which is obtained by coding an audio video sig- 
nal including an audio signal and a video signal. This 
apparatus comprises: a packing unit for performing 
packing to divide the audio video stream into plural 
streams corresponding to packs as data units each hav- 
ing a predetermined size, and outputting the audio 
video stream corresponding to each pack as pack data; 
a recorder for recording each pack data on the record- 
ing medium, as an access unit to the recording medium; 
and an attribute detector for detecting an attribute relat- 
ing to at least one of the video signal and the audio sig- 
nal, and outputting attribute data indicating the attribute. 
The packing unit performs the packing such that a posi- 
tion in the audio video stream, where the attribute 
changes, is positioned at the head of the pack. There= 
fore, a portion where the video attribute or the audio 
attribute changes, of the audio video stream whose 
recording addresses are managed in pack units, can be 
recognized. When playing the audio video stream, the 
potion where the attribute change occurs can be 
accessed quickly. 

[0089] According to a sixth aspect of the present 
invention, in the recording apparatus of the fifth aspect, 
the attribute detector detects a video attribute relating to 
the video signal and an audio attribute relating to the 
audio signal on the basis of the audio video stream, and 
outputs video attribute data indicating the video attribute 
and audio attribute data indicating the audio attribute; 
and the recorder records the video attribute data and 
the audio attribute data in predetermined areas of the 
recording medium, respectively. Therefore, when play- 
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ing the audio video stream recorded on the recording 
medium, the video attribute and the audio attribute cor- 
responding to this stream can be identified. 
[0090] According to a seventh aspect of the present 
invention, the recording apparatus of the fifth aspect fur- 5 
ther comprises an information generator for detecting a 
recording position in the audio video stream recorded 
on the recording medium, which recording position cor- 
responds to a point where at least one of the audio 
attribute and the video attribute changes, or a recording 10 
time of the audio video stream based on a reference 
time, which recording time corresponds to the attribute 
change point, and outputting attribute change informa- 
tion indicating the recording position or the recording 
time; and the recorder records the attribute change 15 
information on the recording medium. Therefore, when 
playing the audio video stream recorded on the record- 
ing medium, a portion of the stream where the video 
attribute or the audio attribute changes can be 
accessed quickly by reading the recording position or 20 
the recording time corresponding to the attribute 
change point. 

[0091] According to an eighth aspect of the present 
invention, in the recording apparatus of the fifth aspect, 
the attribute detector detects the video resolution of the 25 
video signal as an attribute relating to the video signal, 
and outputs video resolution data indicating the resolu- 
tion; and the packing unit performs the packing, on the 
basis of the video resolution data, such that a position in 
the video stream, where the video resolution changes, 30 
is positioned at the head of the pack. Therefore, a por- 
tion where the video resolution changes, of the audio 
video stream whose recording addresses are managed 
in pack units, can be recognized. When playing the 
audio video stream, the portion of the stream where the 35 
video resolution changes can be accessed quickly. 
[0092] According to a ninth aspect of the present 
invention, in the recording apparatus of the fifth aspect, 
the attribute detector detects the aspect ratio of the 
video signal as an attribute relating to the video signal, 40 
and outputs aspect ratio data indicating the aspect ratio; 
and the packing unit performs the packing, on the basis 
of the aspect ratio data, such that a position in the video 
stream, where the aspect ratio changes, is positioned at 
the head of the pack. Therefore, a portion where the 45 
aspect ratio changes, of the audio video stream whose 
recording addresses are managed in pack units, can be 
recognized. When playing the audio video stream, the 
portion where the aspect ratio change occurs can be 
accessed quickly. so 
[0093] According to a tenth aspect of the present 
invention, there is provided a recording apparatus for 
recording, on a recording medium, an audio video 
stream which is obtained by coding an audio video sig- 
nal including an audio signal and a video signal. This 55 
apparatus comprises a video object composer for divid- 
ing the audio video stream into plural streams corre- 
sponding to management units for managing the audio 



video stream, and outputting the stream corresponding 
to each management unit as video object data; a 
recorder for recording management information for 
managing each video object data on the recording 
medium; and an attribute detector for detecting an 
attribute relating to at least one of the video signal and 
the audio signal on the basis of the audio video stream, 
and outputting attribute data indicating the attribute. 
When the attribute changes, the video object composer 
divides the audio video stream, on the basis of the 
attribute data, such that a portion of the audio video 
stream before the attribute change point and a portion 
of the audio video stream after the attribute change 
point are output as different video object data, There- 
fore, when playing the audio video stream recorded on 
the recording medium, decoding of each video object 
data is performed on the basis of the video attribute or 
audio attribute which is managed in video object units. 
[0094] According to an eleventh aspect of the 
present invention, in the recording apparatus of the 
tenth aspect, the management information includes 
information relating to the recording position of each 
video object data on the recording medium, or the 
recording time of each video object data based on a ref- 
erence time. Therefore, the audio video stream 
recorded on the recording medium can be managed in 
video object units. 

[0095] According to a twelfth aspect of the present 
invention, in the recording apparatus of the tenth 
aspect, the attribute detector detects a video attribute 
relating to the video signal and an audio attribute relat- 
ing to the audio signal on the basis of the audio video 
stream, and outputs video attribute data indicating the 
video attribute and audio attribute data indicating the 
audio attribute; and the management information 
includes video attribute information indicating the video 
attribute and audio attribute information indicating the 
audio attribute. Therefore, the attribute of the audio 
video stream recorded on the recording medium can be 
managed in video object units. 

[0096] According to a thirteenth aspect of the 
present invention, in the recording apparatus of the 
tenth aspect, the attribute detector detects the video 
resolution of the video signal as an attribute relating to 
the video signal, and outputs video resolution data indi- 
cating the resolution; and when the video resolution 
changes, the video object composer divides the video 
stream, on the basis of the video resolution data, such 
that a portion of the video stream before the video reso- 
lution change point and a portion of the video stream 
after the video resolution change point are output as dif- 
ferent video object data. Therefore, the video attribute 
can be unified with respect to one video object data. 
Thereby, when playing the audio video stream recorded 
on the recording medium, decoding of each video object 
data can be performed on the basis of the video resolu- 
tion which is managed in video object units. 
[0097] According to a fourteenth aspect of the 
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present invention, in the recording apparatus of the 
tenth aspect, the attribute detector detects the aspect 
ratio of the video signal as an attribute relating to the 
video signal, and outputs aspect ratio data indicating 
the aspect ratio; and when the aspect ration changes, 5 
the video object composer divides the video stream, on 
the basis of the aspect ratio data, such that a portion of 
the video stream before the aspect ratio change point 
and a portion of the video stream after the aspect ratio 
change point are output as different video object data. 10 
Therefore, the aspect ratio can be unified with respect to 
one video object data. Thereby, when playing the audio 
video stream recorded on the recording medium, 
decoding of each video object data can be performed 
on the basis of the aspect ratio which is managed in 75 
video object units. 

[0098] According to a fifteenth aspect of the present 
invention, a coding apparatus for coding a video signal 
comprises a video encoder for subjecting the video sig- 
nal to intra-frame coding or inter-frame coding such that 20 
a group of frames including at least one frame subjected 
to the intra-frame coding is generated, and a video 
stream corresponding to the frame group is output as a 
stream unit that can be accessed randomly; and a video 
attribute detector for detecting a video attribute of the 25 
video signal, and outputting video attribute data indicat- 
ing the video attribute. The video encoder forms the 
frame group such that a specific frame, whose video 
attribute is different from that of a frame positioned just 
before it, is stored as a head frame in the frame group. 30 
Therefore, when playing the video stream recorded on 
the recording medium with each frame group as an 
access unit, a portion of the video stream where the 
video attribute changes can be accessed quickly. 
[0099] According to a sixteenth aspect of the 35 
present invention, the coding apparatus of the fifteenth 
aspect further comprises an audio attribute detector for 
detecting an audio attribute of an audio signal 
appended to the video signal, and outputting audio 
attribute data indicating the audio attribute; and the 40 
video encoder forms the frame group such that a spe- 
cific frame, whose video or audio attribute is different 
from that of a frame positioned just before it, is stored as 
a head frame in the frame group. Therefore, when play- 
ing the video stream recorded on the recording medium 45 
with each frame group as an access unit, a portion of 
the video stream corresponding to the attribute change 
point can be accessed quickly. 

[0100] According to a seventeenth aspect of the 
present invention, the coding apparatus of the sixteenth 50 
aspect further comprises a packing unit for performing 
packing to divide the video stream into plural streams 
corresponding to packs as data units each having a pre- 
determined size, and outputting the stream correspond- 
ing to each pack as pack data; and the packing unit 55 
performs the packing such that a position in the video 
stream, where the video or audio attribute changes, is 
positioned at the head of the pack. Therefore, when 



playing the video stream recorded on the recording 
medium while managing the addresses in pack units, a 
portion of the stream corresponding to the attribute 
change point can be accessed quickly. 
[0101] According to an eighteenth aspect of the 
present invention, in the coding apparatus of the fif- 
teenth aspect, the video encoder performs coding on 
each frame in a specific frame group including the spe- 
cific frame, without referring to the video signal corre- 
sponding to frames in a frame group which has been 
coded previously to the specific frame group. Therefore, 
the video stream corresponding to the specific frame 
group can be played randomly. That is, coding of the 
specific frame group is performed without referring to 
the video signal of an already processed frame group 
whose video attribute (video resolution) is different from 
that of the specific frame group. Therefore, when a tar- 
get frame to be coded is subjected to inter-frame predic- 
tion coding, only a frame whose video attribute (video 
resolution) is identical to that of the target frame is used 
as a reference frame, whereby decoding of the video 
stream is facilitated. Further, when the target frame is 
subjected to inter-frame prediction coding, only a frame 
whose aspect ratio is identical to that of the target frame 
is used as a reference frame, whereby the target frame 
is coded efficiently. 

[0102] According to a nineteenth aspect of the 
present invention, in the coding apparatus of the fif- 
teenth aspect, the video attribute detector detects the 
video resolution of the video signal as a video attribute 
on the basis of the video signal, and outputs video res- 
olution data indicating the video resolution; and the 
video encoder forms the frame group, on the basis of 
the video resolution data, such that a specific frame, 
whose video resolution is different from that of a frame 
positioned just before it, is stored as a head frame in the 
frame group. Therefore, when playing the video stream 
recorded on the recording medium with each frame 
group as an access unit, a portion of the video stream 
where the video resolution changes can be accessed 
quickly. 

[0103] According to a twentieth aspect of the 
present invention, the coding apparatus of the nine- 
teenth aspect further comprises a packing unit for per- 
forming packing to divide the video stream into plural 
streams corresponding to packs as data units each hav- 
ing a predetermined size, and outputting the stream cor- 
responding to each pack as pack data; and the packing 
unit performs the packing such that a position in the 
video stream, where the video resolution changes, is 
positioned at the head of the pack. Therefore, when 
playing the video stream recorded on the recording 
medium while managing the addresses in pack units, a 
portion of the stream corresponding to the resolution 
change point can be accessed quickly. 
[0104] According to a twenty-first aspect of the 
present invention, in the coding apparatus of the fif- 
teenth aspect, the attribute detector detects the aspect 
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ratio of the video signal as the video attribute on the 
basis of the video signal, and outputs aspect ratio data 
indicating the aspect ratio; and the video encoder forms 
the frame group, on the basis of the aspect ratio data, 
such that a specific frame, whose aspect ratio is differ- 5 
ent from that of a frame positioned just before it, is 
stored as a head frame in the frame group. Therefore, 
when playing the video stream recorded on the record- 
ing medium with each frame group as an access unit, a 
portion of the video stream where the aspect ratio 
changes can be accessed quickly. 
[0105] According to a twenty-second aspect of the 
present invention, the coding apparatus of the twenty- 
first aspect further comprises a packing unit for perform- 
ing packing to divide the video stream into plural 
streams corresponding to packs as data units each hav- 
ing a predetermined size, and outputting the stream cor- 
responding to each pack as pack data; and the packing 
unit performs the packing such that a position in the 
video stream, where the aspect ratio changes, is posi- 
tioned at the head of the pack. Therefore, when playing 
the video stream recorded on the recording medium 
while managing the addresses in pack units, a portion of 
the stream corresponding to the aspect ratio change 
point can be accessed quickly. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0106] 

Figure 1 is a block diagram for explaining a record- 
ing apparatus according to a first embodiment of 
the present invention. 

Figure 2 is a block diagram for explaining a record- 
ing apparatus according to a second embodiment 
of the present invention. 

Figure 3 is a block diagram for explaining a record- 
ing apparatus according to a third embodiment of 
the present invention. 

Figure 4 is a block diagram for explaining a record- 
ing apparatus according to a fourth embodiment of 
the present invention. 

Figure 5 is a block diagram for explaining a record- 
ing apparatus according to a fifth embodiment of 
the present, invention. 

Figure 6 is a block diagram for explaining a record- 
ing apparatus according to a sixth embodiment of 
the present invention. 

Figure 7 is a block diagram for explaining a record- 
ing apparatus according to a seventh embodiment 
of the present invention. 

Figure 8 is a block diagram for explaining a record- 
ing apparatus according to an eighth embodiment 
of the present invention. 

Figure 9 is a block diagram for explaining a record- 
ing apparatus according to a ninth embodiment of 
the present invention. 

Figure 1 0 is a block diagram for explaining a coding 



apparatus according to a tenth embodiment of the 
present invention. 

Figure 11 is a block diagram for explaining a coding 
apparatus according to an eleventh embodiment of 
the present invention. 

Figure 1 2 is a block diagram for explaining a coding 
apparatus according to a twelfth embodiment of the 
present invention. 

Figure 13 is a schematic diagram for explaining the 
operation of the coding apparatus according to the 
twelfth embodiment, illustrating plural frames con- 
stituting one GOP. 

Figure 1 4 is a block diagram for explaining a coding 
apparatus according to a thirteenth embodiment of 
the present invention. 

Figure 1 5 is a block diagram for explaining a coding 
apparatus according to a fourteenth embodiment of 
the present invention. 

Figure 1 6 is a block diagram for explaining a coding 
apparatus according to a fifteenth embodiment of 
the present invention. 

Figure 17 is a diagram illustrating the structure of 
one GOP based on MPEG coding, and the struc- 
ture of coded data corresponding to the GOP. 
Figure 18 is a diagram for explaining a recording 
format employed when recording an audio video 
stream on a recording medium, and illustrates 
video attribute information in detail. 
Figure 19 is a diagram for explaining another 
recording format employed when recording an 
audio video stream on a recording medium, and 
illustrates video attribute information in detail. 
Figure 20 is a diagram for explaining a recording 
format employed when recording an audio video 
stream on a recording medium, and illustrates 
audio attribute information in detail. 
Figures 21(a) and 21 (b) are diagrams illustrating a 
format of a video object (21(a)) employed when 
recording an audio video stream on a recording 
medium, and video packets (21(b)) constituting the 
video object. 

Figures 22(a) and 22(b) are diagrams illustrating a 
format of a video object (22(a)) employed when 
recording an audio video stream on a recording 
medium, and video packets (22(b)) constituting the 
video object. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[Embodiment 1] 

[0107] Figure 1 is a block diagram for explaining a 
recording apparatus 100a according to a first embodi- 
ment of the present invention. 

[0108] The recording apparatus 100a includes a 
broadcast receiver 102 and a VOB composer 103. The 
broadcast receiver receives a digital broadcast signal 
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Sbr, and outputs an audio video stream Sav. The VOB 
composer 1 03 converts the audio video stream Sav into 
recordable data according to a format of a video stand- 
ard, and outputs VOB data Dvob. 

[0109] The broadcast receiver 102 has a TS/PS 5 
conversion function for converting a transmission format 
stream (transport stream) into a record format stream 
(program stream), and the audio video stream Sav out- 
putted from the broadcast receiver 102 is a program 
stream. Further, a packing unit (not shown) is provided 10 
between the broadcast receiver 1 02 arid the VOB com- 
poser 103. The packing unit divides the audio video 
stream Sav into plural streams corresponding to first 
data units each having a predetermined size (2Kbyte). 
The packing unit outputs data (PCK data) correspond- 15 
ing to each data unit (pack) to the VOB composer 103. 
Further, this packing unit outputs, as the PCK data, 
audio packs containing audio streams (A_PCK) arid 
video packets containing video streams (V_PCK) . 
Although the PCK data is fixed- length data of 2Kbyte, it 20 
may be variable-length data. In this case, the data size 
is written in the header of each pack. Further, as shown 
in figures 21 (a)&(b) and 22(a)&(b), the VOB composer 
103 forms a third data unit (VOBU) comprising plural 
packs on the basis of the PCK data supplied from the 25 
packing unit, and further forms a second data unit 
(VOB) comprising plural VOBUs, and outputs the data 
corresponding to the VOB as the above-described VOB 
data Dvob. 

[0110] The recording apparatus 100a further so 
includes a video attribute detector 107 and an audio 
attribute detector 109. The video attribute detector 107 
receives the audio video stream Sav, detects a video 
attribute or the video stream included in the audio video 
stream Sav by analyzing header information of the 35 
video stream, and outputs data indicating the video 
attribute (video attribute data Dva) . The audio attribute 
detector 109 receives the audio video stream Sav, 
detects an audio attribute of the audio stream included 
in the audio video stream Sav by analyzing header infor- 40 
mation of the audio stream, and outputs data indicating 
the audio attribute (audio attribute data Daa). 
[0111] The video attribute is, for example, any of the 
coding mode (e.g., MPEG-1 coding or MPEG-2 coding), 
horizontal frame size, the vertical frame size, frame fre- 45 
quency, pixel sampling frequency, chrominance signal 
component format, aspect ratio, letter box, bit rate, 
scanning mode (interlace or progressive), etc. The 
video attribute detector 1 07 detects at least one of these 
attributes. The audio attribute is, for example, any of the 50 
coding mode, number of channels, sampling frequency, 
number of quantized bits, dynamic range control, bit 
rate, etc. The audio attribute detector 109 detects at 
least one of these attributes. 

[0112] The recording apparatus 100a further 55 
includes a system controller 111, an address informa- 
tion generator 1 08, and a VMG generator 1 1 0. The sys- 
tem controller 1 1 1 generates write address information 



Iwa which indicates a position (write address) where 
data is to be written in the recording area of the record- 
ing medium 106. The address information generator 
108 outputs specific address information Isa which indi- 
cates the write address at the time when the attribute 
changes, on the basis of the video attribute data Dva, 
the audio attribute data Daa, and the address informa- 
tion Iwa. The VMG generator 1 1 0 generates VMG infor- 
mation Dvmg which is management information for 
each VOB, on the basis of the video attribute data Dva, 
the audio attribute data Daa, and the specific address 
information Isa. When the video attribute or audio 
attribute of the audio video stream Sav changes, the 
VMG generator 1 1 0 outputs the VMG information Dvmg 
including the specific address information Isa which 
indicates the write address at the time when the 
attribute change occurs. 

[0113] The recording apparatus 100a further 
includes a recording signal processor 1 1 0 and a record- 
ing head 105. The recording signal processor 110 sub- 
jects the VOB data Dvob from the VOB composer 103 
and the VMG information Dvmg from the VMG genera- 
tor 110 to addition of error correcting code and record- 
ing modulation, and outputs a recording signal Sre 
corresponding to the VOB data. The recording head 
105 writes the recording signal Sre on the recording 
medium 106 such as an optical disk. 
[0114] In figure 1, reference numeral 101 denotes 
an antenna for receiving the digital broadcast signal, 
and the output from the antenna 1 01 is received by the 
broadcast receiver 102. 

[0115] Hereinafter, the operation of the recording 
apparatus 100a will be described. 
[0116] The broadcast signal Sbr from the antenna 
101 is received by the broadcast receiver 102, wherein 
the broadcast signal Sbr is subjected to predetermined 
processing to extract the audio video stream Sav. This 
audio video stream Sav is output to the packing unit (not 
shown), the video attribute detector 107, and the audio 
attribute detector 109. The audio video stream Sav is 
output as a program stream by the TS/PS conversion 
function of the broadcast receiver 102. 
[0117] In the packing unit, the audio video stream 
Sav is packed as described above, and the stream Sav 
corresponding to each pack is output as PCK data to 
the VOB composer 1 03. 

[0118] In the VOB composer 103, the audio video 
stream (PCK data) Sav is converted to VOB data Dvob 
(see figures 21 (a)&(b) and 22(a)&(b)) according to the 
format of the DVD recording standard, and then the 
VOB data Dvob is output to the recording signal proces- 
sor 104. 

[0119] In the recording signal processor 104, the 
VOB data Dvob is subjected to addition of error correct- 
ing code and recording modulation, thereby generating 
a recording signal Sre corresponding to the VOB data 
Dvob. This recording signal Sre is written in a predeter- 
mined recording area of the optical disk (recording 
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medium) 106 by the recording head 105 according to 
the write address information Iwa generated by the sys- 
tem controller 111. 

[0120] In the video attribute detector 107, a video 
attribute of the video stream included in the audio video 5 
stream Sav is detected by analyzing the header infor- 
mation or the video stream, and video attribute data Dva 
indicating the video attribute is output to the address 
information generator 1 08 and the VMG generator 1 1 0. 
[0121] On the other hand, in the audio attribute 10 
detector 1 09, an audio attribute of the audio stream 
included in the audio video stream Sav is detected by 
analyzing the header information of the audio stream, 
and audio attribute data Daa indicating the audio 
attribute is output to the address information generator 15 

1 08 and the VMG generator 1 1 0. 

[0122] In the address information generator 108, 
when at least one of the video attribute and the audio 
attribute changes, specific address information Isa indi- 
cating the write address at the time when the attribute 20 
change occurs is generated, on the basis of the write 
address information Iwa, the video attribute data Dva, 
and the audio attribute data Daa. The specific address 
information Isa is output to the VMG generator 1 1 0. 
[0123] In the VMG generator 1 1 0, VMG information 25 
Dvmg, which is management information for each VOB, 
is generated on the basis of the video attribute informa- 
tion Dva, the audio attribute information Daa, and the 
specific address information Isa. This VMG information 
Dvmg is subjected to addition of error correcting code 30 
and recording modulation by the recording signal proc- 
essor 104, whereby a recording signal Sre correspond- 
ing to the VMG information Dvmg is generated. This 
recording signal Sre is written in the optical disk 106 by 
the recording head 1 05. The recording signal Sre corre- 35 
sponding to the VMG information Dvmg is written in a 
predetermined recording area of the recording medium 
106 after recording of the recording signal Sre corre- 
sponding to the VOB data Dvob is completed. At this 
time, the specific address information Isa is written in 40 
the recording area corresponding to the VMGI 10b1 
(see figure 19) . Further, the video attribute data Dva 
and the audio attribute data Daa are written in the 
V_ATR 10d1 and the A_ATR 10d2 in the VMG informa- 
tion 10a, respectively (see figure 19) . 45 
[0124] As described above, the recording appara- 
tus 1 00a according to the first embodiment of the inven- 
tion is provided with the video attribute detector 1 07 for 
detecting the video attribute, the audio attribute detector 

109 for detecting the audio attribute, and the address 50 
information generator 108 for generating the specific 
address information Isa which indicates the write 
address at the time when the attribute change occurs, 

on the basis of the outputs from these detectors 107 
and 109. The specific address information Isa is written 55 
in the recording area of the optical disk 1 06 correspond- 
ing to the VMG I 1 0b1 which is a component of the man- 
agement information (VMG information) 10a of each 



VOB. Therefore, when playing the audio video stream 
recorded on the optical disk 106, the recording position 
in the audio video stream, at which the attribute change 
occurs, can be recognized by reading the VMG informa- 
tion 10a, whereby quick access can be made to the 
recording position in the stream where the attribute 
change occurs on the optical disk. 
[0125] While in this first embodiment the specific 
address information Isa indicating the data write 
address at the point of time when the video attribute or 
the audio attribute changes is written on the recording 
medium, information relating to the recording time 
based on a reference time, at which the video attribute 
or the audio attribute changes, may be recorded on the 
recording medium instead of the specific address infor- 
mation Isa. 

[Embodiment 2] 

[0126] Figure 2 is a block diagram for explaining a 
recording apparatus 100b according to a second 
embodiment of the present invention. 
[01 27] The recording apparatus 1 00b of this second 
embodiment includes a packing unit 1 12 and a padding 
size generator 113, instead of the packing unit of the 
first embodiment. The packing unit 112 performs the 
same packing process as described for the first embod- 
iment. Further, when the audio attribute or the video 
attribute changes, the packing unit 1 12 inserts padding 
data in the pack which is being formed, following the 
stream just before the attribute change, to prevent the 
stream before the attribute change and the stream after 
the attribute change from being stored in one pack. The 
padding size generator 113 calculates the size of the 
padding data on the basis of the video attribute data 
Dva, the audio attribute data Daa, and an internal signal 
Sop of the packing unit 1 12, and outputs size informa- 
tion Dps indicating the size to the packing unit 1 12. The 
padding data is also called stuffing data. 
[0128] Other constituents of the recording appara- 
tus 1 00b are identical to those already described, for the 
recording apparatus 100a of the first embodiment. 
[0129] Reference numeral 103b denotes a VOB 
composer according to this second embodiment, and 
the VOB composer 1 03b converts the PCK data Dpck 
from the packing unit 112 into VOB data Dvob according 
to the format of the video standard. 
[0130] Next, the operation of the recording appara- 
tus 100b will be described. 

[0131] The audio video stream Sav supplied from 
the broadcast receiver 1 02 is divided into plural streams 
corresponding to data units (first data units) each hav- 
ing a predetermined data size by the packing unit 112, 
and PCK data Dpck corresponding to each stream 
(pack) is output to the VOB composer 1 03b. At this time, 
the audio stream is output as audio PCK data (hereinaf- 
ter also referred to simply as "audio pack"), and the 
video stream is output as video PCK data (hereinafter 



12 



23 



EP 1 081 967 A2 



24 



also referred to simply as "video pack"). 
[0132] In the VOB composer 103, a predetermined 
number of audio packs and video packs equivalent to a 
predetermined display time are put together as a VOBU 
(third data unit). Then, VOB data Dvob, which com- 
prises a plurality of VOBUs equivalent to the audio video 
stream Sav inputted from the start of recording to the 
end of recording, is generated. 

[0133] The VOB data Dvob is converted into a 
recording signal Sre by the recording signal processor 
1 04, and recorded on the recording medium 1 06 by the 
optical head 1 05 according to the write address Iwa. 
[0134] Further, the video attribute detector 107 
detects the video attribute to output the video attribute 
data Dva, and the audio attribute detector 109 detects 
the audio attribute and to output the audio attribute data 
Daa. 

[0135] In the address information generator 108, 
specific address information Isa indicating the write 
address at the time when the audio attribute or the video 
attribute changes is generated on the basis of the video 
attribute data Dva, the audio attribute data Daa, and the 
write address information iwa. 

[0136] The padding size generator 1 1 3 receives the 
video attribute data Dva, the audio attribute data Daa, 
and the internal signal Sop indicating the packing state 
in the packing unit 112. When the video attribute or the 
audio attribute changes, the padding size generator 1 13 
calculates the size of padding data, and outputs the size 
information Dps indicating the data size. To be specific, 
when the packing unit 112 detects an attribute change 
while storing the stream (video stream or audio stream) 
in a specific pack, the padding size generator 113 calcu- 
lates the vacant space of this pack at this point of time, 
as the size of padding data. 

[0137] When the data size information Dps is input 
to the packing unit 1 12, padding data is inserted in the 
vacant space of the specific pack, by an amount indi- 
cated by the size information Dps, instead of a portion of 
the stream after the position where the attribute change 
occurs. Thereby, even when the attribute change occurs 
during formation of the specific pack, the stream after 
the attribute change is not stored in this pack together 
with the stream before the attribute change. In other 
words, when the attribute change occurs, the stream 
before the attribute change and the stream after the 
attribute change are stored in adjacent different packs, 
respectively. To be specific, the stream after the 
attribute change is stored, from its beginning, in a pack 
which follows the specific pack. 

[0138] Thereafter, the same processes as those 
already described for the first embodiment are carried 
out and, finally, the recording signal Sre corresponding 
to the VMG information Dvmg of each VOB is recorded 
on the recording medium 106. 

[0139] As described above, the recording appara- 
tus 100b of this second embodiment is provided with, 
instead of the packing unit of the first embodiment, the 



packing unit 112 for inserting padding data in the pack 
currently being formed when the audio attribute or the 
video attribute changes, and the padding size generator 
113 which calculates the size or the padding data. In 

5 this recording apparatus 100b, when the attribute 
changes, padding data is inserted in the pack under for- 
mation, instead of the stream after the attribute change. 
Therefore, the stream is recorded such that the head of 
the stream after the attribute change is matched to the 

10 head of the pack. 

[0140] Accordingly, when playing the audio video 
stream recorded on the optical disk 1 06, it is possible to 
recognize the recording position corresponding to the 
portion where the video or audio attribute changes, in 

15 the audio video stream whose recording addresses are 
managed in pack units, by reading the VMG information 
1 0a, whereby quick access can be made to the record- 
ing position of the stream where the attribute change 
occurs, on the optical disk. 

20 [0141] While in this second embodiment the prede- 
termined position of the stream and the head position of 
the pack are matched to each other by inserting the 
padding data in the pack, the method of matching the 
predetermined position of the stream to the head posi- 

25 tion of the pack is not restricted thereto. For example, 
the size information of the pack data written in the pack 
header or the like may be changed. 
[0142] Further, in the first and second embodi- 
ments, both of the video attribute and the audio attribute 

30 are detected, and the specific address information Isa is 
generated according to changes in these attributes. 
However, only one of the video attribute and the audio 
attribute may be detected, and the specific address 
information Isa may be generated according to a 

35 change in the detected attribute. 

[Embodiment 3] 

[0143] Figure 3 is a block diagram for explaining a 
40 recording apparatus 100c according to a third embodi- 
ment of the present invention. 

[0144] This recording apparatus 100c includes a 
VMG generator 1 1 0c instead of the address information 
generator 108 and the VMG generator 110 of the first 

45 embodiment. The VMG generator 1 1 0c generates VMG 
information Dvmg, which is management information for 
each VOB, on the basis of the video attribute data Dva 
and the audio attribute data Daa supplied from the 
attribute detectors 107 and 109, respectively. 

so [0145] The recording apparatus 100c further 
includes a VOB composer 103c instead of the VOB 
composer 1 03 of the first embodiment. When either the 
audio attribute or the video attribute changes, the VOB 
composer 103c generates VOB data so that the stream 

55 before the attribute change and the stream after the 
attribute change are separated as different VOB data, 
on the basis of the video attribute data Dva and the 
audio attribute data Daa. 
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[0146] Other constituents of the recording appara- 
tus 1 00c are identical to those already described for the 
recording apparatus 1 00a of the first embodiment. 
[0147] Hereinafter, the operation of the recording 
apparatus 100c will be described. 
[0148] In this third embodiment, the broadcast sig- 
nal Sbr from the antenna 101 is received by the broad- 
cast receiver 1 02, and the audio video stream Sav from 
the receiver 1 02 is input to the VOB composer 1 03c, the 
video attribute detector 107, and the audio attribute 
detector 109. In the VOB composer 103c, the audio 
video stream Sav is converted to VOB data Dvob (see 
figures 21 (a)&(b) and 22(a)&(b)) according to the format 
of the DVD recording standard. This VOB data Dvob is 
converted to a recording signal Sre by the recording sig- 
nal processor 104, and the recording signal Sre is writ- 
ten in the optical disk 106 by the recording head 105. 
[0149] Further, in the video attribute detector 107, 
the video attribute is detected on the basis of the audio 
video stream Sav, and the video attribute data Dva is 
output to the VOB composer 1 03c and the VMG gener- 
ator 110. In the audio attribute detector 109, the audio 
attribute is detected on the basis of the audio video 
stream Sav, and the audio attribute data Daa is output to 
the VOB composer 1 03c and the VMG generator 1 1 0. 
[0150] The receiver 102, the VOB composer 103c, 
the recording signal processor 104, the recording head 
1 05, and the attribute detectors 1 07 and 1 09 operate in 
the same way as described for the first embodiment. 
[0151] In this third embodiment, when the video 
attribute orthe audio attribute changes during recording 
of the audio video stream Sav, the VOB composer 1 03c 
forms VOB data corresponding to the audio video 
stream Sav such that the stream before the attribute 
change and the stream after the attribute change are 
separated as different VOB data. 

[0152] In the VMG generator 110c, VMG informa- 
tion Dvmg as management information for recorded 
VOB data is generated, and the VMG information Dvmg 
is output to the recording signal processor 104. In the 
processor 104, the VMG information Dvmg is subjected 
to addition of error correcting code and to recording 
modulation, thereby generating a recording signal Sre 
corresponding to the VMG information Dvmg. The 
recording signal Sre is recorded on the optical disk 106 
by the optical head 105. 

[0153] As shown in figure 18, the VMG information 
Dvmg includes, as management information for each 
VOB, recording start address information, search infor- 
mation, video attribute information, and audio attribute 
information. The search information is information for 
searching a predetermined position in the recorded 
audio video stream. 

[0154] As described above, the recording appara- 
tus 100c of this third embodiment is provided with the 
VOB composer 103c which generates VOB data such 
that, when either the audio attribute or the video 
attribute changes, the stream before the attribute 



change and the stream after the attribute change are 
separated as different VOB data. Therefore, the video 
attribute and the audio attribute can be unified for one 
VOB data. Accordingly, when playing the signal 

5 recorded on the optical disk 1 06, each VOB data can be 
decoded on the basis of the video attribute and the 
audio attribute which are managed in VOB units. 
[0155] Further, when playing the audio video 
stream, quick access can be made to the recording 

10 position where the attribute change occurs, in the 
stream recorded on the optical disk, by reading the 
VMG information Dvmg. 

[01 56] Further, in this third embodiment, both of the 
video attribute and the audio attribute are detected, and 
15 the VOB data is generated according to changes in 
these attributes. However, only one of the video 
attribute and the audio attribute may be detected, and 
the VOB data may be generated according to a change 
in the detected attribute. 

20 

[Embodiment 4] 

[0157] Figure 4 is a block diagram for explaining a 
recording apparatus 100d according to a fourth embod- 

25 iment of the present invention. 

[01 58] Like the recording apparatus 1 00a of the first 
embodiment, the recording apparatus 100d receives a 
digital broadcast signal Sbr, and records an audio video 
stream Sav obtained from the digital broadcast signal 

30 Sbr on a recording medium according to the format of 
the DVD recording standard. 

[0159] To be specific, the recording apparatus 100d 
includes a broadcast receiver 1 02 for receiving a digital 
broadcast signal Sbr from an antenna 101, and output- 

35 ting an audio video stream Sav; a VOB composer 103 
for converting the audio video stream Sav into VOB data 
Dvob according to the format of the DVD recording 
standard; a recording signal processor 1 10 for subject- 
ing the VOB data Dvob to addition of error correcting 

40 code and to recording modulation, and outputting a 
recording signal Sre corresponding to the VOB data; 
and a recording head 105 for writing the recording sig- 
nal Sre on a recording medium 106. Further, in the 
recording apparatus 100d, a packing unit (not shown) 

45 for packing the audio video stream Sav is provided 
between the broadcast receiver 102 and the VOB com- 
poser 1 03, and the VOB composer 1 03 is supplied with 
PCK data as the audio video stream Sav. 
[0160] The antenna 101, the receiver 102, the VOB 

so composer 103, the recording signal processor 104, the 
recording head 1 05, the recording medium 1 06, and the 
packing unit (not shown) are identical to those 
described for the first embodiment. 
[0161] The recording apparatus 100d further 

55 includes a video resolution detector 107d for receiving 
the audio video stream Sav, detecting the video resolu- 
tion by analyzing the header information of the video 
stream included in the audio video stream Sav, and out- 
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putting data Dvr indicating the video resolution (video 
resolution data) . 

[0162] The video resolution detector 107d detects, 
as the video resolution, at least one of the horizontal 
resolution, vertical resolution, and time resolution 
(frame frequency) . 

[0163] The horizontal resolution indicates the frame 
size in the horizontal resolution, and information such 
as 352, 480, 544, 704, 720, 1 440, or 1 920 pixels is used 
as the frame size. 

[0164] The vertical resolution indicates the frame 
size in the vertical resolution, and information such as 
240, 480, 576, 720, or 1080 lines is used as the frame 
size. 

[0165] The time resolution is indicated by the frame 
frequency, and a value such as 24, 30, 50, 59.97, or 
60Hz is used as the value of the frame frequency. 
[0166] The video resolution detector 107d may 
detect, as the time resolution, that the audio video 
stream Sav corresponds to either an interlace signal or 
a progressive signal. 

[0167] Further, the recording apparatus 100d 
includes a system controller 111 for generating write 
address information Iwa which indicates an address of a 
recording area in the recording medium 106 where the 
recording signal is to be written; an address information 
generator 108d for outputting specific address informa- 
tion Isa indicating the write address at the time when the 
video resolution changes, on the basis of the video res- 
olution data Dvr and the write address information Iwa; 
and a VMG generator 110d for generating VMG infor- 
mation Dvmg that is management information for the 
VOB data, on the basis of the video resolution data Dvr 
from the detector 107d and the specific address infor- 
mation Isa from the address information generator 108. 
[0168] When the video resolution of the audio video 
stream Sav changes, the specific address information 
Isa indicating the write address at the time when the 
resolution change occurs, is stored in the VMG informa- 
tion Dvmg, as described for the first embodiment. 
[0169] Next, the operation of the recording appara- 
tus lOOd will be described. 

[0170] In this fourth embodiment, the broadcast sig- 
nal Sbrfrom the antenna 101 is received by the broad- 
cast receiver 1 02, and the audio video stream Sav from 
the receiver 102 is input to the VOB composer 103 
through the packing unit (not shown) . In the VOB com- 
poser 103, the audio video stream Sav is converted to 
VOB data Dvob (see figures 21(a)&(b) and 22(a)&(b)) 
according to the format of the DVD recording standard. 
This VOB data Dvob is converted to a recording signal 
Sre by the recording signal processor 104, and the 
recording signal Sre is written in the optical disk 106 by 
the recording head 105. 

[0171] The receiver 102, the VOB composer 103, 
the recording signal processor 104, the recording head 
105, and the packing unit (not shown) operate in the 
same way as described for the first embodiment. 



[0172] In the video resolution detector 107d, the 
video resolution is detected on the basis of the audio 
video stream Sav from the receiver 1 02, and video res- 
olution data Dvr is output to the address information 

5 generator 1 08d and the VMG generator 1 1 0d. 

[0173] Further, in the system controller 111, the 
write address information Iwa for the data is generated. 
In the address information generator 108d, when the 
video resolution changes, the specific address informa- 

10 tion Isa indicating the write address at the time when the 
video the resolution change occurs, is output to the 
VMG generator 1 1 0d, on the basis of the output from 
the detector 107d and the write address information 
Iwa. 

15 [0174] In the VMG generator 1 10d, the VMG infor- 
mation Dvmg as management information for each VOB 
is generated on the basis of the video resolution data 
Dvr and the output from the address information gener- 
ator 108d. This VMG information Dvmg is subjected to 

20 addition of error correcting code and to recording mod- 
ulation by the recording signal processor 104, thereby 
generating a recording signal Sre corresponding to the 
VMG information Dvmg. This recording signal Sre is 
written in the optical disk 106 by the recording head 

25 1 05. 

[0175] The recording signal Sre corresponding to 
the VMG information Dvmg is written in a predeter- 
mined recording area of the recording medium 1 06 after 
recording of the recording signal Sre corresponding to 

30 the VOB data Dvob is completed. At this time, the spe- 
cific address information Isa is written as search infor- 
mation in the recording area corresponding to the VMGI 
10b1 (see figure 19). Further, the video resolution data 
Dvr is recorded in the V_ATR 10d1 in the VMG informa- 

35 tion 10a (see figure 19). 

[0176] As described above, the recording appara- 
tus 100d according to the fourth embodiment of the 
invention is provided with the video resolution detector 
1 07d for detecting the video resolution, and the address 

40 information generator 108d for generating the specific 
address information Isa which indicates the data write 
address at the time when the video resolution changes, 
on the basis of the output from the detector 107d. The 
specific address information Isa is stored in the VMGI 

45 10b1 which is a component of the management infor- 
mation (VMG information) 1 0a of each VOB. Therefore, 
when playing the audio video stream recorded on the 
optical disk 106, the recording position corresponding to 
the portion of the audio video stream where the video 

so resolution changes can be recognized by reading the 
VMG information 1 0a, and this enables quick access to 
the position in the recorded stream where the video res- 
olution changes. 



[0177] Figure 5 is a block diagram for explaining a 
recording apparatus 100e according to a fifth embodi- 
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merit of the present invention. 

[0178] The recording apparatus 100e of this fifth 
embodiment includes a packing unit 112 and a padding 
size generator 113, instead of the packing unit (not 
shown) of the recording apparatus 100d according to 5 
the fourth embodiment. The packing unit 112 performs 
the same packing process as described for the first 
embodiment. Further, when the video resolution 
changes, the packing unit 112 inserts padding data in 
the pack which is being formed at this time, following the 10 
stream just before the resolution change, to prevent the 
stream before the resolution change and the stream 
after the resolution change from being stored in one 
pack. The padding size generator 113 calculates the 
size of the padding data on the basis of the video reso- 15 
lution data Dvr and an internal signal Sop from the pack- 
ing unit 1 12, and outputs size information Dsp indicating 
this size to the packing unit 1 1 2. 
[0179] Other constituents of the recording appara- 
tus 1 0Oe are identical to those already described for the 20 
recording apparatus 100d of the fourth embodiment. 
[0180] Reference numeral 103b denotes a VOB 
composer according to this fifth embodiment, and this 
VOB composer 103b converts the PCK data Dpck from 
the packing unit 112 into VOB data Dvob according to 25 
the format of the DVD recording standard, like the VOB 
composer 1 03 according to the fourth embodiment. 
[0181] Next, the operation of the recording appara- 
tus 100e will be described. 

[0182] When the audio video stream Sav supplied 30 
from the broadcast receiver 102 is input to the packing 
unit 112, it is divided into plural streams corresponding 
to data units (first data units) each having a predeter- 
mined data size, arid PCK data Dpck corresponding to 
each stream (pack) is output to the VOB composer 35 
103b. At this time, the audio stream is output as audio 
PCK data (audio pack), and the video stream is output 
as video PCK data (video pack). 

[0183] In the VOB composer 103, a predetermined 
number of audio packs and video packs equivalent to a 40 
predetermined display time are put together as a VOBU 
(third data unit) . Then, VOB data Dvob, which com- 
prises a plurality of VOBUs equivalentto the audio video 
stream Sav inputted from the start of recording to the 
end of recording, is generated. 45 
[0184] The VOB data Dvob is converted into a 
recording signal Sre by the recording signal processor 
104, and the recording signal Sre is recorded on the 
recording medium 106 by the optical head 105 accord- 
ing to the write address Iwa. so 
[0185] In the video resolution detector 107d, the 
video resolution is detected, and the video resolution 
data Dvr is output. In the address information generator 
108d, specific address information Isa indicating the 
write address at the time when the video resolution 55 
changes is generated on the basis of the video resolu- 
tion data Dvr and the write address information Iwa. 
[0186] The padding size generator 1 1 3 receives the 



video resolution data Dvr and the internal signal Sop 
indicating the packing state in the packing unit 112. 
When the video resolution changes, the padding size 
generator 113 calculates the size of padding data, and 
outputs information Dps (size information) indicating the 
data size. To be specific, when the resolution change is 
detected while the stream (video stream or audio 
stream) is being stored in a specific pack by the packing 
unit 112, the vacant space of the pack at this point of 
time is calculated as the size of padding data. 
[0187] When the data size information Dps is input 
to the packing unit 112, padding data is inserted in the 
vacant space of the pack, by an amount indicated by the 
size information Dps, instead of a portion of the stream 
after the position where the resolution change occurs. 
Therefore, even when the resolution change occurs dur- 
ing formation of the specific pack, the stream after the 
resolution change is not stored in this pack together with 
the stream before the resolution change. In other words, 
when the resolution change occurs, the stream before 
the resolution change and the stream after the resolu- 
tion change are stored in adjacent different packs, 
respectively. That is, the stream after the resolution 
change is stored, from its beginning, in a pack which fol- 
lows the specific pack. 

[0188] Thereafter, the recording apparatus 100e 
operates in the same way as described for the recording 
apparatus 100d of the fourth embodiment, and the 
recording signal Sre corresponding to the VMG informa- 
tion Dvmg of each VOB is recorded on the recording 
medium 106. 

[0189] As described above, the recording appara- 
tus 100e of this fifth embodiment is provided with, 
instead of the packing unit of the fourth embodiment, 
the packing unit 112 for inserting padding data in the 
pack under formation when the video resolution 
changes, and the padding size generator 1 13 for calcu- 
lating the size of the padding data. When the resolution 
changes, the padding data is inserted in the pack under 
formation, instead of the stream after the resolution 
change. Therefore, the stream is recorded such that the 
head of the stream after the resolution change is 
matched to the head of the pack. 

[0190] Accordingly, when playing the audio video 
stream recorded on the optical disk 1 06, the recording 
position corresponding to the portion where the resolu- 
tion changes, in the audio video stream whose record- 
ing addresses are managed in pack units, can be 
recognized by reading the VMG information 10a, and 
this enables quick access to the recording position of 
the audio video stream where the resolution change 
occurs on the optical disk. 

[0191] While in this fifth embodiment the predeter- 
mined position of the stream and the head position of 
the pack are matched to each other by inserting pad- 
ding data in the pack, the method of matching the pre- 
determined position of the stream to the head position 
of the pack is not restricted thereto. For example, the 
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size information of the pack data written in the pack 
header or the like may be changed. 

[Embodiment 6] 

[0192] Figure 6 is a block diagram for explaining a 
recording apparatus 100f according to a sixth embodi- 
ment of the present invention. 

[0193] This recording apparatus 100f includes a 
VMG generator 1 1 0f instead of the address information 
generator 108d and the VMG generator 1 1 0d of the 
fourth embodiment. The VMG generator 1 1 0f generates 
VMG information Dvmg which is management informa- 
tion for each VOB, on the basis of the video resolution 
data Dvrfrom the resolution detector 107d. 
[0194] The recording apparatus 100f further 
includes a VOB composer 1 03f instead of the VOB com- 
poser 103 of the fourth embodiment. When the video 
resolution changes, the VOB composer 103f generates 
VOB data so that the stream before the resolution 
change and the stream after the resolution change are 
separated as different VOB data, on the basis of the 
video resolution data Dvr. 

[0195] Other constituents of the recording appara- 
tus 100f are identical to those already described for the 
recording apparatus 100d of the fourth embodiment. 
[0196] Hereinafter, the operation of the recording 
apparatus 100f will be described. 

[01 97] In this fifth embodiment, the broadcast signal 
Sbr from the antenna 101 is received by the broadcast 
receiver 1 02, and the audio video stream Sav from the 
receiver 1 02 is input to the VOB composer 1 03f and the 
video resolution detector 107d. In the VOB composer 
103f, the audio video stream Sav is converted to VOB 
data Dvob (see figures 21(a)&(b) and 22(a)&(b)) 
according to the format of the DVD recording standard. 
This VOB data Dvob is converted to a recording signal 
Sre by the recording signal processor 104, and the 
recording signal Sre is written in the optical disk 106 by 
the recording head 105. 

[0198] Further, in the video resolution detector 
1 07d, the video resolution is detected on the basis of 
the audio video stream Sav, and the video resolution 
data Dva is output to the VOB composer 103f and the 
VMG generator 11 Of. 

[0199] The receiver 102, the VOB composer 103f, 
the recording signal processor 104, the recording head 
105, and the video resolution detector 107d operate in 
the same way as described for the fourth embodiment. 
[0200] In this sixth embodiment, when the video 
resolution changes during recording of the audio video 
stream Sav, the VOB composer 1 03f forms VOB data 
Dvob corresponding to the audio video stream Sav such 
that the stream before the resolution change and the 
stream after the resolution change are separated as dif- 
ferent VOB data. 

[0201 ] In the VMG generator 1 1 0f, VMG information 
Dvmg which is management information for recorded 



VOB data is generated, and the VMG information Dvmg 
is output to the recording signal processor 104. In the 
processor 104, the VMG information Dvmg is subjected 
to addition of error correcting code and to recording 
5 modulation, thereby generating a recording signal Sre 
corresponding to the VMG information Dvmg. The 
recording signal Sre is recorded on the optical disk 106 
by the optical head 1 05. 

[0202] As shown in figure 18, the VMG information 

10 Dvmg includes, as management information for each 
VOB, recording start address information, search infor- 
mation, video attribute information such as video resolu- 
tion information, and audio attribute information. 
[0203] As described above, the recording appara- 

15 tus 1 0Of of this sixth embodiment is provided with the 
VOB composer 1 03f which generates VOB data so that, 
when the video resolution changes, the stream before 
the resolution change and the stream after the resolu- 
tion change are separated as different VOB data. 

20 Therefore, the video resolution can be unified for one 
VOB data. Accordingly, when playing the signal 
recorded on the optical disk 1 06, decoding of each VOB 
can be performed on the basis of the video resolution 
which is managed in VOB units. 

25 [0204] Further, when playing the audio video 
stream recorded on the optical disk, reading the VMG 
information Dvmg enables quick access to the recording 
position in the stream where the resolution change 
occurs. 

30 

[Embodiment 7] 

[0205] Figure 7 is a block diagram for explaining a 
recording apparatus 100g according to a seventh 

35 embodiment of the present invention. 

[0206] The recording apparatus 100g includes an 
aspect ratio detector 107g and an address information 
generator 108g instead of the video resolution detector 
107d and the address information generator 108d 

40 according to the fourth embodiment, respectively. The 
aspect ratio detector 107g receives an audio video 
stream Sav, detects the aspect ratio of the video stream 
included in the audio vide stream Sav by analyzing the 
header information of the video stream, and outputs 

45 data Dar indicating the aspect ratio. The address infor- 
mation generator 1 08d generates specific address infor- 
mation Isa indicating the write address at the time when 
the aspect ratio changes, on the basis of the output Dar 
from the detector 107g and the write address informa- 

so tion Iwa from the system controller 111. The specific 
address information Isa and the aspect ratio data Dar 
are output to the VMG generator 110. Other constitu- 
ents of the recording apparatus 100g are identical to 
those of the recording apparatus 100d according to the 

55 fourth embodiment. 

[0207] The aspect ratio detector 107g detects, as 
the aspect ratio, a ratio such as 16:9 or 4:3, the type of 
letter box, or the like. 
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[0208] Next, the operation of the recording appara- 
tus lOOg will be described. 

[0209] In this seventh embodiment, as in the fourth 
embodiment, the broadcast signal Sbr is received by 
the broadcast receiver 1 02, and the audio video stream 5 
Sav outputted from the receiver 1 02 is input to the VOB 
composer 103 through the packing unit (not shown). 
Further, the VOB data Dvob from the VOP composer 
1 03 is converted to a recording signal Sre by the record- 
ing signal processor 1 04, and the recording signal Sre is 
written in the optical disk 106 by the recording head 
105. 

[0210] The receiver 102, the VOB composer 103, 
the packing unit (not shown), the VOB composer 103, 
the recording signal processor 104, and the recording 
head 1 05 operate in the same way as described for the 
fourth embodiment. 

[0211] In this seventh embodiment, in the aspect 
ratio detector 1 07g, the aspect ratio data Dar indicating 
the aspect ratio of the video frame is detected by ana- 
lyzing the header of the video stream included in the 
audio video stream Sav, and the aspect ratio data Dar is 
output to the address information generator 108g and 
the VMG generator 1 1 0d. 

[0212] Further, in the address information genera- 
tor 108g, when the aspect ratio changes, the specific 
address information Isa indicating the write address at 
the time when the aspect ratio changes, is output to the 
VMG generator 110d, on the basis of the output Sar 
from the detector 107g and the write address informa- 
tion Iwa from the controller 111. 

[0213] In the VMG generator 1 10d, the VMG infor- 
mation Dvmg that is management information for each 
VOB is generated on the basis of the output Dar from 
the aspect ratio detector 107g and the output Isa from 
the address information generator 108g. This VMG 
information Dvmg is subjected to addition of error cor- 
recting code and to recording modulation by the record- 
ing signal processor 104, thereby generating a 
recording signal Sre corresponding to the VMG informa- 
tion Dvmg. This recording signal Sre is written in the 
optica! disk 106 by the recording head 105. 
[0214] The recording signal Sre corresponding to 
the VMG information Dvmg is written in a predeter- 
mined recording area of the recording medium 1 06 after 
recording of the recording signal Sre corresponding to 
the VOB data Dvob is completed. At this time, the spe- 
cific address information Isa is written as search infor- 
mation in the recording area corresponding to the VMG I 
1 0b1 (see figure 19) . Further, the aspect ratio data Dar 
is recorded in the V_ATR 1 0dl in the VMG information 
1 0a (see figure 1 9) . 

[0215] As described above, the recording appara- 
tus 100g according to the seventh embodiment of the 
invention is provided with the aspect ratio detector 1 07g 
for detecting the aspect ratio data Dar of the video 
frame, and the address information generator 108g for 
generating the specific address information Isa which 



indicates the data write address at the time when the 
aspect ratio changes, on the basis of the aspect ratio 
data Dar from the detector 107g. The specific address 
information Isa is written in the recording area of the 
optical disk 1 06 corresponding to the VMGI 1 0b1 which 
is a component of the management information (VMG 
information) 10a of each VOB. Therefore, when playing 
the audio video stream recorded on the optical disk 1 06, 
the recording position in the audio video stream, at 
which the aspect ratio changes, can be recognized by 
reading the VMG information 10a, and this enables 
quick access to the position where the aspect ratio 
changes, in the stream recorded on the optical disk. 

[Embodiment 8] 

[0216] Figure 8 is a block diagram for explaining a 
recording apparatus 100h according to an eighth 
embodiment of the present invention. 
[0217] The recording apparatus 100h of this eighth 
embodiment includes an aspect ratio detector 107g and 
an address information generator 108g instead of the 
video resolution detector 107d and the address informa- 
tion generator 108d according to the fifth embodiment. 
The aspect ratio detector 1 07g receives an audio video 
stream Sav, detects the aspect ratio of the video stream 
included in the audio vide stream Sav by analyzing the 
header information of the video stream, and outputs 
data Dar indicating the aspect ratio. The address infor- 
mation generator 1 08d generates specific address infor- 
mation Isa indicating the write address at the time when 
the aspect ratio changes, on the basis of the output Dar 
from the detector 107g and the write address informa- 
tion Iwa from the system controller 111. The specific 
address information Isa and the aspect ratio data Dar 
are output to the VMG generator 110, and the aspect 
ratio data Dar is output to the packing unit 1 12 and the 
padding size generator 113. Other constituents of the 
recording apparatus 100g are identical to those of the 
recording apparatus lOOe according to the fifth embod- 
iment. 

[0218] The aspect ratio detector 1 07g detects, as 
the aspect ratio, a ratio such as 1 6:9 or 4:3, the type of 
letter box, or the like. 

[0219] Next, the operation of the recording appara- 
tus 100h will be described. 

[0220] In this eighth embodiment, the audio video 
stream Sav from the broadcast receiver 1 02 is input to 
the packing unit 112, wherein the stream Sva is divided 
into plural streams corresponding to data units (packs) 
each having a predetermined data size by the packing 
unit 112, and PCK data Dpck corresponding to each 
pack is output to the VOB composer 103b. In the VOB 
composer 103b, a predetermined number of PCK data 
Dpck equivalent to a predetermined display time are put 
together as a VOBU which is one data unit. Then, VOB 
data Dvob, which comprises a plurality of VOBUs equiv- 
alent to the audio video stream Sav inputted from the 
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start of recording to the end of recording, is generated. 
[0221] The VOB data Dvob is converted into a 
recording signal Sre by the recording signal processor 
104, and the recording signal Sre is recorded on the 
recording medium 106 by the optical head 105. 
[0222] In the aspect ratio detector 1 07g, the aspect 
ratio data Dar is detected, and the video resolution data 
Dvr is output to the address information generator 1 08g, 
the padding size generator 113, and the packing unit 
112. 

[0223] In the address information generator 108g, 
specific address information Isa indicating the write 
address at the time when the aspect ratio changes is 
generated on the basis of the aspect ratio data Dar and 
the write address information Iwa from the system con- 
troller 111. 

[0224] When the aspect ratio changes, the padding 
size generator 1 13 calculates the size of padding data 
on the basis of the internal signal Sop indicating the 
packing state of the packing unit 1 12, and outputs size 
information Dps indicating the data size. To be specific, 
when the aspect ratio change is detected on the basis of 
the aspect ratio data Dar while the stream is being 
stored in a specific pack, the vacant space of this pack 
at this point of time is calculated as the size of padding 
data. 

[0225] When the data size information Dps is input 
to the packing unit 1 12, padding data is inserted in the 
vacant space of the pack, by an amount indicated by the 
size information Dps, instead of a portion of the stream 
after the position where the aspect ratio changes. 
Therefore, even when the aspect ratio changes during 
formation of the specific pack, the stream after the 
aspect ratio change is not stored in this pack together 
with the stream before the aspect ratio change. In other 
words, when the aspect ratio changes, the stream 
before the aspect ratio change and the stream after the 
aspect ratio change are stored in adjacent different 
packs, respectively. That is, the stream after the aspect 
ratio change is stored, from its beginning, in a pack 
which follows the specific pack. 

[0225] Thereafter, the recording apparatus lOOh 
operates in the same way as described for the recording 
apparatus 100e of the fifth embodiment, and the record- 
ing signal Sre corresponding to the VMG information 
Dvmg of each VOB is recorded on the recording 
medium 1 06. 

[0227] The recording The recording signal Sre cor- 
responding to the VMG data Dvmg is written in a prede- 
termined recording area of the recording medium 106 
after recording of the recording signal Sre correspond- 
ing to the VOB data Dvob is completed. At this time, the 
specific address information Isa is written as search 
information in the recording area corresponding to the 
VMG I 10b1 (see figure 19). Further, the aspect ratio 
data Dar is recorded in the V_ATR 10d1 in the VMG 
information 10a (see figure 19). 

[0228] As described above, the recording appara- 



tus 100h of this eighth embodiment is provided with the 
packing unit 112 which divides the audio video stream 
Sav into data units each having a predetermined data 
size. When the aspect ratio changes, padding data is 
5 inserted in the pack under formation, instead of the 
stream after the aspect ratio change. Therefore, the 
stream is recorded such that the head of the stream 
after the aspect ratio change is matched to the head of 
the pack. 

10 [0229] Accordingly, when playing the audio video 
stream recorded on the optical disk 106, the recording 
position where the aspect ratio changes, in the audio 
video stream whose recording addresses are managed 
in pack units, can be recognized by reading the VMG 

15 information 10a, and this enables quick access to the 
position where the aspect ratio changes in the stream 
recorded on the optical disk. 

[0230] While in this eighth embodiment the prede- 
termined position of the stream and the head position of 

20 the pack are matched to each other by inserting pad- 
ding data in the pack, the method of matching the pre- 
determined position of the stream to the head position 
of the pack is not restricted thereto. For example, the 
size information of the pack data written in the pack 

25 header or the like may be changed. 

[Embodiment 9] 

[0231] Figure 9 is a block diagram for explaining a 
30 recording apparatus 1 0Oi according to a ninth embodi- 
ment of the present invention. 

[0232] The recording apparatus 100i includes an 
aspect ratio detector 107g instead of the video resolu- 
tion detector 107d according to the sixth embodiment. 

35 The aspect ratio detector 107g receives an audio video 
stream Sav, detects the aspect ratio of the video stream 
included in the audio vide stream Sav by analyzing the 
header information of the video stream, and outputs 
data Dar indicating the aspect ratio. The aspect ratio 

40 data Dar is output to the VOP composer 103f and the 
VMG generator 1 1 0f. Other constituents of the record- 
ing apparatus 1 001 are identical to those of the record- 
ing apparatus 100f according to the sixth embodiment. 
[0233] The aspect ratio detector 107g detects, as 

45 the aspect ratio, a ratio such as 16:9 or 4:3, the type of 
letter box, or the like. 

[0234] Hereinafter, the operation of the recording 
apparatus 1 0Oi will be described. 

[0235] In the recording apparatus 100i, the broad- 
so cast signal Sbr is received by the broadcast receiver 
1 02, and the audio video stream Sav outputted from the 
receiver 1 02 is input to the VOB composer 1 03f through 
the packing unit (not shown) . The VOP data Dvob from 
the VOP composer 103 is converted to a recording sig- 
55 nal Sre by the recording signal processor 104, and the 
recording signal Sre is written in the optical disk 1 06 by 
the recording head 105. 

[0236] The receiver 102, the packing unit (not 
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shown), the VOB composer 103f, the recording signal 
processor 104, and the recording head 105 operate in 
the same way as described for the sixth embodiment. 
[0237] In this ninth embodiment, in the aspect ratio 
detector 107g, the aspect ratio data Dar indicating the 5 
aspect ratio of the video frame is detected by analyzing 
the header of the video stream included in the audio 
video stream Sav, and the aspect ratio data Dar so 
detected is output to the VOP composer 103f and the 
VMG generator 101f. w 
[0238] In the VOB composer 1 03f, when the aspect 
ratio changes during recording of the audio video 
stream Sav, VOB data Dvob corresponding to the audio 
video stream Sav is formed so that the stream before 
the aspect ratio change and the stream after the aspect 15 
ratio change are separated as different VOB data. 
[0239] In the VMG generator 1 1 0f, VMG information 
Dvmg as management information for recorded VOB 
data is generated, and the VMG information Dvmg is 
output to the recording signal processor 104. In the 20 
processor 104, the VMG information Dvmg is subjected 
to addition of error correcting code and to recording 
modulation, thereby generating a recording signal Sre 
corresponding to the VMG information Dvmg. The 
recording signal Sre is recorded on the optical disk 106 25 
by the optical head 105. 

[0240] As shown in figure 18, the VMG information 
Dvmg includes, as management information for each 
VOB, recording start address information, search infor- 
mation video attribute information, and audio attribute 30 
information. 

[0241] As described above, the recording appara- 
tus 100i of this ninth embodiment is provided with the 
VOB composer 1 03f which generates VOB data so that, 
when the aspect ratio changes, the stream before the 35 
aspect ratio change and the stream after the aspect 
ratio change are separated as different VOB data, as a 
VOB composer for converting the audio video stream 
Sav into VOB data according to the format of the DVD 
recording standard. Therefore, the aspect ratio can be 40 
unified for one VOB data. Accordingly, when playing the 
signal recorded on the optica! disk 106, decoding of 
each VOB data can be performed on the basis of the 
aspect ratio which is managed in VOB units. 
[0242] Further, when playing the audio video 45 
stream, reading the VMG information Dvmg enables 
quick access to the recording position in the stream 
where the aspect ratio changes. 

[0243] While in the first to ninth embodiments a dig- 
ital broadcast signal is input to the recording apparatus, 50 
a broadcast signal to be input to the recording appara- 
tus is not restricted thereto, and it may be an analog TV 
signal. 

[0244] In this case, information relating to the video 
attribute, such as video resolution or aspect ratio, and 55 
information relating to the audio attribute are detected 
on the basis of information superposed in the vertical 
blanking period. 



[0245] Further, while in the first to ninth embodi- 
ments the output from the broadcast receiver 102 is 
recorded after converting it into data according to the 
DVD recording standard, means for performing com- 
pressive coding on the output of the receiver 102 may 
be provided and the output from this coding means may 
be recorded after converting it into data according to the 
DVD recording standard. 

[0246] Further, while in the first to ninth embodi- 
ments information relating to the recording address of 
the point where the video attribute (e.g., video resolu- 
tion or aspect ratio) or the audio attribute changes is 
recorded as the VMG information which is management 
information for each VOP, information relating to the 
attribute change point is not restricted to the address 
information. For example, the recording time at which 
the video or audio attribute changes, based on the 
recording start time, may be used as the VMG informa- 
tion. 

[0247] In this case, in order to make quick access to 
the recorded data corresponding to the predetermined 
recording time, information indicating a time map for 
associating the recording time with a recording address 
is required. According to the time map, the recording 
time is converted into the recording address, and 
recorded data corresponding to the obtained recording 
address is accessed. 

[Embodiment 10] 

[0248] Figure 10 is a block diagram for explaining a 
coding apparatus 1 0Oj according to a tenth embodiment 
of the present invention. 

[0249] The coding apparatus 1 0Oj is an apparatus 
for coding a video signal included in an audio video sig- 
nal on the basis of the video attribute and the audio 
attribute. 

[0250] To be specific, the coding apparatus 100j 
includes an intra-frame encoder 122a for subjecting a 
video signal Svi to intra-frame coding to generate first 
coded data Ev1, and an inter-frame encoder 122b for 
subjecting the video signal Svi to inter-frame coding to 
generate second coded data Ev2. 
[0251] The intra-frame encoder 122a subjects the 
video signal Svi of a frame to be processed to intra- 
frame coding using only data within this frame. For 
example, the encoder 122a performs I frame coding 
base on MPEG. The inter-frame encoder 122b subjects 
the video signal Svi corresponding to a target frame to 
be processed to inter-frame predictive coding with refer- 
ence to image data of an already processed frame other 
than the target frame. For example, the encoder 122b 
performs P or B frame coding based on MPEG. 
[0252] The coding apparatus 100j further includes 
an input-side switch 121 for supplying the video signal 
Svi to either the encoder 122a or the encoder 122b 
according to a control signal Sent, and an output-side 
switch 1 23 for selecting either the first coded data Evi or 
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the second coded data Ev2 according to the control sig- 
nal Sent, and outputting the selected coded data as a 
video stream Dstr. 

[0253] The input-side switch 121 has an input termi- 
nal 121 a to which the video signal Svi is applied, a first 5 
output terminal 121b for supplying the video signal Svi 
to the intra-frame encoder 122a, and a second output 
terminal 121c for supplying the video signal Svi to the 
inter-frame encoder 122b. The input-side switch 121 
connects the input terminal 121a to either the first out- 10 
put terminal 121b or the second output terminal 121c 
according to the switch control signal Sent. The output- 
side switch 123 has a first input terminal 123a to which 
the first coded date Evl is applied, a second input termi- 
nal 1 23b to which the second coded data Ev2 is applied, 15 
and an output terminal 123c for outputting the video 
stream Dstr. The output-side switch 123 connects either 
the first input terminal 1 23a or the second input terminal 
1 23b to the output terminal 1 23c according to the switch 
control signal Sent. 20 
[0254] The coding apparatus 1 0Oj further includes a 
video attribute detector 124 and an audio attribute 
detector 125. The video attribute detector 124 detects 
the video attribute on the basis of the video signal Svi 
included in the audio video signal, and outputs video 25 
attribute data Dva. The audio attribute detector 125 
detects the audio attribute on the basis of the audio sig- 
nal Sau included in the audio video signal, and outputs 
audio attribute data Daa. 

[0255] Further, the coding apparatus 100j includes 30 
a coding controller 126 and a prediction mode controller 
127. The coding controller 126 outputs the switch con- 
trol signal Sent to the respective switches 121 and 123 
on the basis of the video attribute data Dva and the 
audio attribute data Daa. The prediction mode controller 35 
127 outputs a coding control signal Pent for controlling 
inter-frame coding, to the inter-frame encoder 122b, on 
the basis of the video attribute data Dva and the audio 
attribute data Daa. 

[0256] With reference to figure 17, in the state 40 
where there is no change in the video attribute and the 
audio attribute, the coding controller 126 controls the 
input-side and output-side switches 121 and 123 so that 
intra-frame coding and inter-frame coding are per- 
formed cyclically, thereby forming a GOP comprising a 45 
predetermined number of frames. On the other hand, 
when the video attribute or the audio attribute changes, 
the coding controller 126 controls the input-side and 
output-side switches 121 and 123 so as to change the 
cycles of the intra-frame coding and the inter-frame cod- 50 
ing, thereby matching the head of the stream after the 
video or audio attribute change to the head of the 
stream constituting the GOP. 

[0257] In the state where there is no change in the 
video attribute and the audio attribute, the prediction 55 
mode controller 127 controls the inter-frame encoder 
122b so that B-frame coding is carried out by using, as 
reference frames, two frames positioned before and 



after the target B frame. On the other hand, when the 
video attribute or the audio attribute changes, the pre- 
diction mode controller 127 controls the inter-frame 
encoder 122b so that B-frame coding is carried out with- 
out using, as a reference frame, a frame included in an 
already processed GOP whose video attribute or audio 
attribute is different from that of a specific containing the 
target frame. The already processed GOP is a GOP 
which has been subjected to coding for each frame 
before coding of the specific GOP. 
[0258] In order to place the frame immediately 
before the video attribute change as the last frame in 
the GOR the types of frames constituting the GOP may 
be adjusted by continuously performing coding so that 
the last several frames in the GOP are P frames or I 
frames. 

[0259] Hereinafter, the operation of the coding 
apparatus 100j will be described. 
[0260] When an audio video signal is input to the 
coding apparatus 1 0Oj, a video signal Svi included in the 
audio video signal and corresponding to each frame is 
applied to either the intra-frame encoder 122a or the 
inter-frame encoder 122b through the input-side switch 
121. 

[0261] In the intra-frame encoder 122a, the video 
signal Svi of the target frame is subjected to intra-frame 
coding without referring to image data of another frame, 
thereby generating first coded data (intra-frame coded 
data) Ev1 . That is, in the intra-frame encoder 122a, the 
video signal Svi is independently coded using only the 
image data included in the target frame. To be specific, 
l-frame coding based on MPEG is carried out. On the 
other hand, in the inter-frame encoder 122b, the video 
signal Svi of the target frame is coded with reference to 
image data of another frame, thereby generating sec- 
ond coded data (inter-frame coded data) Ev2. That is, in 
the inter-frame encoder 1 22b, the video signal Svi of the 
target frame is subjected to inter-frame predictive cod- 
ing using image data of a predetermined frame as a ref- 
erence frame. To be specific, P or B frame coding based 
on MPEG is carried out. 

[0262] The coded data generated in each encoder 
is output as a video stream Dstr through the output-side 
switch 123a. 

[0263] In the video attribute detector 1 24, the video 
attribute of the video signal Svi included in the inputted 
audio video signal is detected, and video attribute data 
Dva is output to the coding controller 126 and the pre- 
diction mode controller 127. In the audio attribute detec- 
tor 125, the audio attribute of the audio signal Sau 
included in the inputted audio video signal is detected, 
and audio attribute data Daa is output to the coding con- 
troller 126 and the prediction mode controller 127. 
[0264] In the coding controller 126, a control signal 
Sent is output to the input-side and output side switches 
1 21 and 1 23 on the basis of the video attribute data Dva 
and the audio attribute data Daa, and the switches 121 
and 1 23 are controlled so that the video signal Svi of the 
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target frame is subjected to either intra-frame coding or 
inter-frame coding. 

[0265] For example, when there is no change in the 
video attribute and the audio attribute, the switches 121 
and 123 are controlled so that intra-frame coding and 5 
inter-frame coding are cyclically performed as shown in 
figure 17, whereby a GOP comprising a predetermined 
number of frames is constituted. On the other hand, 
when the video attribute or the audio attribute changes, 
the switches 121 and 123 are controlled so that the 10 
head of the stream after the attribute change is matched 
to the head of the stream constituting the GOP, regard- 
less of the cycles of intra-frame coding and inter-frame 
coding. 

[0266] In the prediction mode controller 127, when 15 
there is no change in the video attribute and the audio 
attribute, the inter-frame encoder 122b is controlled so 
that B-frame coding is carried out by using, as reference 
frames, two frames positioned before and after the tar- 
get B frame. On the other hand, when the video attribute 20 
or the audio attribute changes, the inter-frame encoder 
122b is controlled so that B-frame coding is carried out 
without using, as a reference frame, a frame included in 
an already processed GOP whose video attribute or 
audio attribute is different from that of a specific GOP 25 
including the target frame. 

[0267] As described above, the coding apparatus 
1 0Oj of this tenth embodiment is provided with the cod- 
ing controller 126 for deciding that either intra-frame 
coding or inter-frame coding is performed on the video 30 
signal Svi of each frame, on the basis of the video 
attribute data Dva and the audio attribute data Daa, and 
the prediction mode controller 127 for controlling the 
prediction mode of inter-frame coding on the basis of 
the attribute data Dva and Daa. When the video 35 
attribute or the audio attribute changes, the coding 
mode and the prediction mode are controlled so that the 
head of the stream after the attribute change is matched 
to the head of the stream constituting the GOP and, fur- 
ther, a frame included in a GOP whose attribute is differ- 40 
ent from that of the GOP including the target frame is 
not used as a reference frame. Therefore, a video signal 
whose video attribute (e.g., coding mode, video resolu- 
tion, or video aspect ratio) changes, can be coded 
according to the coding method based on MPEG. 45 
[0268] That is, in the coding apparatus 100j of this 
tenth embodiment, even when the video attribute or the 
audio attribute changes, since the stream of the first 
frame after the attribute change is matched to the 
stream of the head frame of the GOP, it is possible to so 
make access to the video or audio attribute change 
point in the video stream recorded on the recording 
medium, with the GOP as the access unit. 
[0269] Further, since a frame included in a GOP 
whose video or audio attribute is difference from that of 55 
the target frame is not used as a reference frame for the 
target frame, independent coding or decoding can be 
performed on each of the GOPs before and after the 



attribute change point and, therefore, coding and 
decoding can be performed without failure even when 
the video or audio attribute changes. 

[Embodiment 11] 

[0270] Figure 1 1 is a block diagram for explaining a 
coding apparatus 100k according to an eleventh 
embodiment of the present invention. 
[0271 ] This coding apparatus 1 00k includes a pack- 
ing unit 128 in addition to the constituents of the coding 
apparatus 1 0Oj according to the tenth embodiment. The 
packing unit 128 subject the video stream Dstr output- 
ted from the output-side switch 123 to packing on the 
basis of the video attribute data Dva and the audio 
attribute data Daa, and outputs the packed video 
stream Dpck. The packing process is to divide the video 
stream Dstr into streams corresponding to data units 
(packs) each having a predetermined size. 
[0272] In the state where there is no change in the 
video attribute and the audio attribute, the packing unit 
128 performs packing on the video stream Dstr so that 
the video stream Dstr is divided into streams corre- 
sponding to data units (packs) each having a predeter- 
mined size. On the other hand, when the video attribute 
or the audio attribute changes, the packing unit 128 per- 
forms packing on the video stream Dstr so that the head 
of the stream after the video or audio attribute change is 
matched to the head of the stream stored in the pack. 
[0273] More specifically, when an attribute change 
occurs in the state where the stream is being stored in a 
specific pack, the packing unit 128 inserts predeter- 
mined padding data in this pack instead of the stream 
after the attribute change. The method for storing only 
the streams of the same attribute in the pack when an 
attribute change occurs is not restricted to the above- 
mentioned method of inserting predetermined padding 
data instead of the stream after the attribute change. 
For example, when the attribute changes, the pack size 
in the header information of the pack may be changed to 
prevent the stream after the attribute change from being 
included in this pack, following the stream before the 
attribute change. 

[0274] Other constituents of the coding apparatus 
1 00k are identical to those of the coding apparatus 1 0Oj 
of the tenth embodiment. 

[0275] Next, the function and effect will be 
described. 

[0276] In the coding apparatus 1 00k of this eleventh 
embodiment, like the coding apparatus 1 0Oj of the tenth 
embodiment, when the video attribute or the audio 
attribute changes, coding of the video signal is per- 
formed so that the stream of the first frame after the 
attribute change is matched to the stream of the head 
frame of the GOP. 

[0277] Then, in the packing unit 128, the packing 
process of dividing the video stream into packs of a pre- 
determined size is performed so that the video or audio 
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attribute change point in the stream is matched to the 
head of the pack. To be specific, when the attribute 
change occurs while the stream is being stored in a spe- 
cific pack, predetermined padding data is inserted in 
this pack instead of the stream after the attribute 
change. 

[0278] As described above, the coding apparatus 
100k of this eleventh embodiment is provided with the 
packing unit 128 in addition to the constituents of the 
tenth embodiment, and when an attribute change 
occurs while the stream is being stored in a specific 
pack, this packing unit 128 inserts predetermined pad- 
ding data in this pack instead of the stream after the 
attribute change. Therefore, it is possible to make quick 
access to the change point of the video attribute, such 
as the video resolution, in the pack data Dpck which is 
recorded on the recording medium according to 
address management in pack units. 

[Embodiment 12] 

[0279] Figure 12 is a block diagram for explaining a 
coding apparatus 100m according to a twelfth embodi- 
ment of the present invention. 

[0280] The coding apparatus 100m includes a 
video resolution detector 129 instead of the video 
attribute detector 124 and the audio attribute detector 

125 of the coding apparatus 100j according to the tenth 
embodiment. The video resolution detector 129 detects 
the video resolution on the basis of the video signal Svi 
included in the audio video signal, and outputs video 
resolution data Dvr. Preferably, the video resolution 
detector 129 detects, as the video resolution, at least 
one of the horizontal frame size, vertical frame size, and 
frame frequency, like the video resolution detector 1 07d 
according to the fourth embodiment. However, the 
detector 129 may detect, as the video resolution, that 
the video signal is either an interlace signal or a pro- 
gressive signal. 

[0281] Further, the coding apparatus 100m 
includes a coding controller 126m and a prediction 
mode controller 127m instead of the coding controller 

126 and the prediction mode controller 127 of the cod- 
ing apparatus 1 0Oj according to the tenth embodiment. 
The coding controller 126m controls the respective 
switches 121 and 123 by using a switch control signal 
Sent on the basis of the video resolution data Dvr, and 
the prediction mode controller 127m controls the inter- 
frame encoder 122b by using a coding control signal 
Pent on the basis of the video resolution data Dvr. 
[0282] With reference to figure 17, in the state 
where there is no change in the video resolution, the 
coding controller 126m controls the switches 121 and 
123 so that intra-frame coding and inter-frame coding 
are performed cyclically. On the other hand, when the 
video resolution changes, the coding controller 126m 
controls the switches 121 and 123 so as to change the 
cycles of intra-frame coding and inter-frame coding, 



thereby matching the head of the stream after the reso- 
lution change to the head of the stream constituting the 
GOP. 

[0283] In the state where there is no change in the 
5 video resolution, the prediction mode controller 127m 
controls the inter-frame encoder 122b so that B-frame 
coding is carried out by using, as reference frames, two 
frames positioned before and after the target B frame. 
On the other hand, when the video resolution changes, 
10 the prediction mode controller 127m controls the inter- 
frame encoder 122b so that B-frame coding is carried 
out without using, as a reference frame, a frame 
included in an already processed GOP whose video 
resolution is different from that of a specific GOP includ- 
15 ing the target frame. 

[0284] Other constituents of the coding apparatus 
1 00m are identical to those of the coding apparatus 1 0Oj 
of the tenth embodiment. 

[0285] Next, the operation of the coding apparatus 

20 1 00m will be described. 

[0286] Figure 13 shows a TV signal (video signal) 
inputted to the coding apparatus 100m. 
[0287] In figure 13, the video signal up to a frame 
F(j-3) is a high-vision signal, and the subsequent video 

25 signal from a frame F(j-2) is a standard TV signal. That 
is, the frame F(j-2) is a frame corresponding to the posi- 
tion where the video resolution changes in the video sig- 
nal. In figure 13, F(j-5) and F(j-4) are frames 
corresponding to the high-vision signal, and F(j-1)~ 

30 FQ+9) are frames corresponding to the standard TV sig- 
nal. 

[0288] When the video signal shown in figure 13 is 
input to the coding apparatus 100m, the video signal is 
coded by either the intra-frame encoder 122a or the 
35 inter-frame encoder 122b according to the video resolu- 
tion and the coding mode, and a video stream Dstr 
obtained in the coding process is output. The coding 
mode is information indicating that each frame is to be 
subjected to any of l-frame coding, P-frame coding, and 
40 B-frame coding, on the basis of the video signal. 

[0289] Hereinafter, a brief description will be given 
of the operations of the video resolution detector 129, 
the coding controller 126m, and the prediction mode 
controller 127m when the video signal shown in figure 
45 1 3 is input to the coding apparatus 1 00m. 

[0290] The video resolution detector 129 detects 
the video resolution on the basis of the inputted video 
signal Svi, and outputs video resolution data Dvr to the 
coding controller 126m and the prediction mode control- 
so Ier127m. 

[0291] Then, the coding controller 126m controls 
the switches 121 and 1 23 so that one GOP is completed 
at the frame F(j-3) and a new GOP is constituted from 
the next frame F(j-2) . Thereby all the frames constitut- 
55 ing one GOP have the same video resolution. In other 
words, frames having different video resolutions are not 
mixed in one GOP. 

[0292] Further, the prediction mode controller 1 27m 



23 



45 



EP 1 081 967 A2 



46 



controls the inter-frame encoder 122b so that only the I 
frame F(j) is used as a reference frame when perform- 
ing inter-frame coding on the B frames F(j-2) and F(j-1 ) . 
[0293] As described above, the coding apparatus 
100m according to the twelfth embodiment is provided 
with the video resolution detector 129 for detecting the 
video resolution on the basis of the video signal, the 
coding controller 1 26m for deciding that the video signal 
Svi of each frame is to be subjected to either intra-f rame 
coding or inter-frame coding on the basis of the video 
resolution data Dvr, and the prediction made controller 
127 for controlling the prediction mode of inter-frame 
coding on the basis of the video resolution data Dvr. 
Thereby, the video signal is separated into two GOPs at 
the video resolution change point and, further, each 
frame in the GOP, whose head frame corresponds to 
the video resolution change point, can be coded without 
referring to a frame of the previous GOP. Therefore, a 
video signal whose video resolution changes can be 
coded according to the MPEG coding method. 
[0294] That is, in the coding apparatus 100m, even 
when the video attribute changes, since the stream of 
the first frame after the resolution change is matched to 
the stream of the head frame of the GOP, it is possible 
to make quick access to the video resolution change 
point in the video stream which is recorded on the 
recoding medium with each GOP as an access unit. 
[0295] Further, since a frame of a GOP whose 
video resolution is different from that of the target frame 
is not used as a reference frame for the target frame, 
independent coding or decoding can be performed on 
each of the GOPs before and after the video resolution 
change point, whereby coding and decoding can be 
performed without failure even when the video resolu- 
tion changes. 

[0296] While in this twelfth embodiment the head 
frame of the first GOP (specific GOP) after the video 
resolution change is a B frame, the first frame of the 
specific GOP may be an I frame. Also in this case, when 
performing inter-frame coding on the specific GOP, a 
frame of a GOP whose video resolution is different from 
that of the specific GOP is not used as a reference 
frame. 

[0297] Further, in order to match the frame immedi- 
ately before the video resolution change to the last 
frame in the GOP previous to the specific GOP, the type 
of each frame may be controlled by continuously per- 
forming coding in which the last some frames in the pre- 
vious GOP are P frames or I frames. 
[0298] Further, while in this twelfth embodiment 
coding is performed such that two B frames are placed 
between adjacent I frame and P frame and between 
adjacent two P frames, the number of B frames is not 
restricted thereto. For example, these B frames may be 
dispensed with. 



[Embodiment 13] 

[0299] Figure 14 is a block diagram for explaining a 
coding apparatus 100n according to a thirteenth 

5 embodiment of the present invention. 

[0300] This coding apparatus 1 0On includes a pack- 
ing unit 1 28n in addition to the constituents of the coding 
apparatus 100m according to the twelfth embodiment. 
The packing unit 128n subject the video stream Dstr 

w outputted from the output-side switch 1 23 to packing on 
the basis of the video resolution data Dvr, and outputs 
the packed video stream Dpck. The packing process is 
to divide the video stream Dstr into streams correspond- 
ing to data units (packs) each having a predetermined 

15 size. 

[0301] The packing unit 128n is identical to the 
packing unit 128 according to the eleventh embodiment 
except that the unit 128n receives the video resolution 
data Dvr while the unit 128 receives the video attribute 

20 data Dva and the audio attribute data Daa. 

[0302] That is, in the state where there is no change 
in the video resolution, the packing unit 128n performs 
packing on the video stream Dstr so that the video 
stream Dstr is divided into streams corresponding to 

25 data units (packs) each having a predetermined size. 
On the other hand, when the video resolution changes, 
the packing unit 128n performs packing on the video 
stream Dstr so that the head of the stream after the res- 
olution change is matched to the head of the stream 

30 stored in the pack. 

[0303] More specifically, when an attribute change 
occurs in the state where the stream is being stored in a 
specific pack, the packing unit 128n inserts predeter- 
mined padding data in this pack instead of the stream 

35 after the attribute change. 

[0304] Other constituents of the coding apparatus 
100n are identical to those of the coding apparatus 
100m according to the twelfth embodiment. 
[0305] Next, the function and effect will be 

40 described. 

[0306] In the coding apparatus 100n of this thir- 
teenth embodiment, like the coding apparatus 100m of 
the twelfth embodiment, when the video resolution 
changes, coding of the video signal is performed so that 

45 the stream of the first frame after the resolution change 
is matched to the stream of the head frame of the GOP. 
[0307] Then, in the packing unit 128n, the packing 
process of dividing the video stream into packs of a pre- 
determined size is performed so that the video resolu- 

50 tion change point in the stream is matched to the head 
of the pack. To be specific, when the attribute change 
occurs while the stream is being stored in a specific 
pack, predetermined padding data is inserted in this 
pack instead of the stream after the resolution change. 

55 [0308] As described above, the coding apparatus 
100n of this thirteenth embodiment is provided with the 
packing unit 128n in addition to the constituents of the 
twelfth embodiment, and when the video resolution 
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changes while the stream is being stored in a specific 
pack, this packing unit 128n inserts predetermined pad- 
ding data in this pack instead of the stream after the res- 
olution change. Therefore, it is possible to make quick 
access to the video resolution change point in the pack 
data Dpck which is recorded on the recording medium 
according to address management in pack units. 

[Embodiment 14] 

[0309] Figure 15 is a block diagram illustrating a 
coding apparatus 100p according to a fourteenth 
embodiment of the present invention. 
[0310] The coding apparatus 100p includes an 
aspect ratio detector 130 instead of the video attribute 
detector 124 and the audio attribute detector 125 of the 
coding apparatus 1 0Oj according to the tenth embodi- 
ment. The aspect ratio detector 130 detects the aspect 
ratio on the basis of the video signal Svi included in the 
audio video signal, and outputs aspect ratio data Dar. 
The aspect ratio detector 130 detects the aspect ratio 
by reading aspect ratio information which is embedded 
in the vertical blanking period of the video signal. Fur- 
ther, the aspect ratio detector 130 detects the value of 
the aspect ratio (1 6:9 or 4:3), the letter box information, 
or the like. 

[031 1 ] Further, the coding apparatus 1 0Op includes 
a coding controller 126p and a prediction mode control- 
ler 127p instead of the coding controller 126 and the 
prediction mode controller 127 of the coding apparatus 
1 0Oj according to the tenth embodiment. The coding 
controller 126p controls the respective switches 121 
and 123 by using a switch control signal Sent on the 
basis of the aspect ratio data Dar, and the prediction 
mode controller 127p controls the inter-frame encoder 
122b by using a coding control signal Pent on the basis 
of the aspect ratio data Dar. 

[0312] With reference to figure 17, in the state 
where there is no change in the aspect ratio, the coding 
controller 126p controls the switches 121 and 123 so 
that intra-frame coding and inter-frame coding are per- 
formed cyclically. On the other hand, when the aspect 
ratio changes, the coding controller 126p controls the 
switches 121 and 123 so as to change the cycles of 
intra-frame coding and inter-frame coding, thereby 
matching the head of the stream after the aspect ratio 
change to the head of the stream constituting the GOP. 
[0313] In the state where there is no change in the 
aspect ratio, the prediction mode controller 127p con- 
trols the inter-frame encoder 122b so that B-frame cod- 
ing is carried out by using, as reference frames, two 
frames positioned before and after the target B frame. 
On the other hand, when the aspect ratio changes, the 
prediction mode controller 127p controls the inter-frame 
encoder 1 22b so that B-frame coding is carried out with- 
out using, as a reference frame, a frame included in an 
already processed GOP whose aspect ratio is different 
from that of a specific GOP including the target frame. 



[0314] Other constituents of the coding apparatus 
1 0Op are identical to those of the coding apparatus 1 0Oj 
of the tenth embodiment. 

[0315] Next, the operation of the coding apparatus 

5 1 0Op will be described. 

[0316] When the video signal Svi is input to the cod- 
ing apparatus 100p, the video signal Svi is coded by 
either the intra-frame encoder 122a or the inter-frame 
encoder 122b according to the change of the aspect 

10 ratio, and a video stream Dstr obtained in the coding 
process is output. 

[0317] That is, the aspect ratio detector 130 detects 
the aspect ratio on the basis of the inputted video signal 
Svi, and outputs aspect ratio data Dar to the coding con- 

15 troller 126p and the prediction mode controller 127p. 
[0318] The coding controller 126p controls the 
switches 121 and 123 so that one GOP is completed at 
the frame just before the aspect ratio change and a new 
GOP is constituted from the next frame. Thereby, all the 

20 frames constituting one GOP have the same aspect 
ratio. In other words, frames having different aspect 
ratios are not mixed in one GOP. 

[031 9] The prediction mode controller 1 27p controls 
the inter-frame encoder 122b so that a frame of a GOP 
25 whose aspect ratio is different from that of the target 
frame is not used as a reference frame for the target 
frame. 

[0320] As described above, the coding apparatus 
100p according to the fourteenth embodiment is pro- 

30 vided with the aspect ratio detector 1 30 for detecting the 
aspect ratio on the basis of the video signal, the coding 
controller 126p for deciding that the video signal Svi of 
each frame is to be subjected to either intra-frame cod- 
ing or inter-frame coding on the basis of the aspect ratio 

35 data Dar, and the prediction mode controller 127p for 
controlling the prediction mode of inter-frame coding on 
the basis of the aspect ratio data Dar. Thereby, the 
video signal is separated into two GOPs at the aspect 
ratio change point and, further, each frame in the GOP, 

40 whose head frame corresponds to the aspect ratio 
change point, can be coded without referring to a frame 
in the previous GOP. Therefore, a video signal whose 
aspect ratio changes can be coded according to the 
MPEG coding method. 

45 [0321] That is, in the coding apparatus 100p, even 
when the aspect ratio changes, since the stream of the 
first frame after the aspect ratio change is matched to 
the stream of the head frame of the GOP, quick access 
can be made to the aspect ratio change point in the 

so video stream which is recorded on the recoding medium 
with each GOP as an access unit. 
[0322] Further, since a frame of a GOP whose 
aspect ratio is different from that of the target frame is 
not used as a reference frame for the target frame, inde- 

55 pendent coding or decoding can be performed on each 
of the GOPs before and after the aspect ratio change 
point. Therefore, coding and decoding can be per- 
formed without failure even when the aspect ratio 



25 



49 

changes. 

[0323] While in this fourteenth embodiment the 
head frame of the first GOP (specific GOP) after the 
aspect ratio change is a B frame, the first frame of the 
specific GOP may be an I frame. Also in this case, when 5 
performing inter-frame coding on the specific GOR a 
frame in a GOP whose aspect ratio is different from that 
of the specific GOP is not used as a reference frame. 
[0324] Further, in order to match the frame immedi- 
ately before the aspect ratio change to the last frame in 10 
the GOP previous to the specific GOP, the type of each 
frame may be controlled by continuously performing 
coding such that the last some frames in the previous 
GOP are P frames or I frames. 

[0325] Further, while in this fourteenth embodiment 15 
coding is performed such that two B frames are placed 
between adjacent I frame and P frame and between 
adjacent two P frames, the number of B frames is not 
restricted thereto. For example, these B frames may be 
dispensed with. 20 

[Embodiment 15] 

[0326] Figure 1 6 is a block diagram for explaining a 
coding apparatus 1 0Oq according to a fifteenth embodi- 25 
ment of the present invention. 

[0327] This coding apparatus 1 0Oq includes a pack- 
ing unit 1 28q in addition to the constituents of the coding 
apparatus 100p according to the fourteenth embodi- 
ment. The packing unit 128q subject the video stream 30 
Dstr outputted from the output-side switch 123 to pack- 
ing on the basis of the aspect ratio data Dar, and out- 
puts the packed video stream Dpck. The packing 
process is to divide the video stream Dstr into streams 
corresponding to data units (packs) each having a pre- 35 
determined size. 

[0328] The packing unit 128q is identical to the 
packing unit 128 according to the eleventh embodiment 
except that the unit 128q receives the aspect ratio data 
Dar while the unit 128 receives the video attribute data 40 
Dva and the audio attribute data Daa. 
[032S] That is, in the state where there is no change 
in the aspect ratio, the packing unit 128q performs pack- 
ing on the video stream Dstr so that the video stream 
Dstr is divided into streams corresponding to data units 45 
(packs) each having a predetermined size. On the other 
hand, when the aspect ratio changes, the packing unit 
128q performs packing on the video stream Dstr so that 
the head of the stream after the aspect ratio change is 
matched to the head of the stream stored in the pack. so 
[0330] More specifically, when the aspect ratio 
changes in the state where the stream is being stored in 
a specific pack, the packing unit 128q inserts predeter- 
mined padding data in this pack instead of the stream 
after the aspect ratio change. 55 
[0331] Other constituents of the coding apparatus 
1 0Oq are identical to those of the coding apparatus 1 00k 
according to the fourteenth embodiment. 
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[0332] Next, the function and effect will be 
described. 

[0333] In the coding apparatus 1 0Oq of this fifteenth 
embodiment, like the coding apparatus 100p of the four- 
teenth embodiment, when the aspect ratio changes, 
coding of the video signal is performed so that the 
stream of the first frame after the aspect ratio change is 
matched to the stream of the head frame of the GOP. 
[0334] Then, in the packing unit 128q, the packing 
process of dividing the video stream into packs of a pre- 
determined size is performed so that the aspect ratio 
change point in the stream is matched to the head of the 
pack. To be specific, when the aspect ratio changes 
while the stream is being stored in a specific pack, pre- 
determined padding data is inserted in this pack instead 
of the stream after the aspect ratio change. 
[0335] As described above, the coding apparatus 
100q of this fifteenth embodiment is provided with the 
packing unit 128q in addition to the constituents of the 
fourteenth embodiment, and when the aspect ratio 
changes while the stream is being stored in a specific 
pack, this packing unit 128q inserts predetermined pad- 
ding data in this pack instead of the stream after the 
aspect ratio change. Therefore, it is possible to make 
quick access to the aspect ratio change point in the 
pack data Dpck which is recorded on the recording 
medium according to address management in pack 
units. 

[0336] The recording apparatuses according to the 
first to ninth embodiments and the coding apparatuses 
according to the tenth to fifteenth embodiments can be 
implemented by software in a computer system, with 
the same effects as described above. 
[0337] Further, while in the first to ninth embodi- 
ment an optical disk is described as a recording 
medium, a recording medium on which an audio video 
stream is recorded is not restricted thereto. Any record- 
ing medium may be employed so long as it can store 
digital data. 

Claims 

1. A recording apparatus for recording, on a recording 
medium, an audio video stream which is obtained 
by coding an audio video signal including an audio 
signal and a video signal, said apparatus compris- 
ing: 

an attribute detector for detecting an attribute 
relating to at least one of the video signal and 
the audio signal on the basis of the audio video 
stream, and outputting attribute data indicating 
the attribute; 

an information generator for detecting a record- 
ing position in the audio video stream recorded 
on the recording medium, which recoding posi- 
tion corresponds to a point where the attribute 
changes, or a recording time of the audio video 
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stream based on a reference time, which 
recording time corresponds to the attribute 
change point, and outputting attribute change 
information indicating the recording position or 
the recording time; and 5 
a recorder for recording the attribute data and 
the attribute change information on the record- 
ing medium. 

2. The recording apparatus of Claim 1 , wherein: 10 

said attribute detector detects a video attribute 
relating to the video signal and an audio 
attribute relating to the audio signal, and out- 
puts attribute data indicating each attribute; 15 
and 

said recorder records the attribute data indicat- 
ing the video attribute and the attribute data 
indicating the audio attribute in predetermined 
recording areas of the recording medium, 20 
respectively. 

3. The recording apparatus of Claim 1 , wherein: 

said attribute detector detects the video resolu- 25 
tion of the video signal as an attribute relating 
to the video signal, and outputs video resolu- 
tion data indicating the video resolution; 
on the basis of the video resolution data, said 
information generator detects a recording posi- 30 
tion in the stream recorded on the recording 
medium, which recording position corresponds 
to a point where the video resolution changes, 
or a recording time of the stream based on a 
reference time, which recording time corre- 35 
sponds to the video resolution change point, 
and outputs attribute change information indi- 
cating the recording position or the recording 
time; and 

said recorder records the video resolution data 40 
and the resolution change information on the 
recording medium. 

4. The recording apparatus of Claim 1 , wherein: 

45 

said attribute detector detects the aspect ratio 
of the video signal as an attribute relating to the 
video signal, and outputs aspect ratio data indi- 
cating the aspect ratio; 

on the basis of the aspect ratio data, said infor- so 
mation generator detects a recording position 
in the stream recorded on the recording 
medium, which recording position corresponds 
to a point where the aspect ratio changes, or a 
recording time of the stream based on a refer- 55 
ence time, which recording time corresponds to 
the aspect ratio change point, and outputs 
aspect ratio change information indicating the 



recording position or the recording time; and 
said recorder records the aspect ratio data and 
the aspect ratio change information on the 
recording medium. 

5. A recording apparatus for recording, on a recording 
medium, an audio video stream which is obtained 
by coding an audio video signal including an audio 
signal and a video signal, said apparatus compris- 
ing: 

a packing unit for performing packing to divide 
the audio video stream into plural streams cor- 
responding to packs as data units each having 
a predetermined size, and outputting the audio 
video stream corresponding to each pack as 
pack data; 

a recorder for recording each pack data on the 
recording medium, as an access unit to the 
recording medium; 

an attribute detector for detecting an attribute 
relating to at least one of the video signal and 
the audio signal, and outputting attribute data 
indicating the attribute; and 
said packing unit performing the packing such 
that a position in the audio video stream, where 
the attribute changes, is positioned at the head 
of the pack. 

6. The recording apparatus of Claim 5, wherein: 

said attribute detector detects a video attribute 
relating to the video signal and an audio 
attribute relating to the audio signal on the 
basis of the audio video stream, and outputs 
video attribute data indicating the video 
attribute and audio attribute data indicating the 
audio attribute; and 

said recorder records the video attribute data 
and the audio attribute data in predetermined 
areas of the recording medium, respectively. 

7. The recording apparatus of Claim 5 further com- 
prising: 

an information generator for detecting a record- 
ing position in the audio video stream recorded 
on the recording medium, which recording 
position corresponds to a point where at least 
one of the audio attribute and the video 
attribute changes, or a recording time of the 
audio video stream based on a reference time, 
which recording time corresponds to the 
attribute change point, and outputting attribute 
change information indicating the recording 
position or the recording time; and 
said recorder records the attribute change 
information on the recording medium. 
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8. The recording apparatus of Claim 5, wherein: 



12. The recording apparatus of Claim 10, wherein: 



said attribute detector detects the video resolu- 
tion of the video signal as an attribute relating 
to the video signal, and outputs video resolu- 
tion data indicating the resolution; and 
said packing unit performs the packing, on the 
basis of the video resolution data, such that a 
position in the video stream, where the video 
resolution changes, is positioned at the head of 
the pack. 

9. The recording apparatus of Claim 5, wherein: 

said attribute detector detects the aspect ratio 
of the video signal as an attribute relating to the 
video signal, and outputs aspect ratio data indi- 
cating the aspect ratio; and 
said packing unit performs the packing, on the 
basis of the aspect ratio data, such that a posi- 
tion in the video stream, where the aspect ratio 
changes, is positioned at the head of the pack. 

10. A recording apparatus for recording, on a recording 
medium, an audio video stream which is obtained 
by coding an audio video signal including an audio 
signal and a video signal, said apparatus compris- 
ing: 

a video object composer for dividing the audio 
video stream into plural streams corresponding 
to management units for managing the audio 
video stream, and outputting the stream corre- 
sponding to each management unit as video 
object data; 

a recorder for recording management informa- 
tion for managing each video object data on the 
recording medium; 

an attribute detector for detecting an attribute 
relating to at least one of the video signal and 
the audio signal on the basis of the audio video 
stream, and outputting attribute data indicating 
the attribute; and 

when the attribute changes, said video object 
composer dividing the audio video stream, on 
the basis of the attribute data, such that a por- 
tion of the audio video stream before the 
attribute change point and a portion of the 
audio video stream after the attribute change 
point are output as different video object data. 

11. The recording apparatus of Claim 10, wherein said 
management information includes information 
relating to the recording position of each video 
object data on the recording medium, or the record- 
ing time of each video object data based on a refer- 
ence time. 



said attribute detector detects a video attribute 
relating to the video signal and an audio 

5 attribute relating to the audio signal on the 

basis of the audio video stream, and outputs 
video attribute data indicating the video 
attribute and audio attribute data indicating the 
audio attribute; and 

10 said management information includes video 

attribute information indicating the video 
attribute and audio attribute information indicat- 
ing the audio attribute. 

15 13. The recording apparatus of Claim 10, wherein: 



said attribute detector detects the video resolu- 
tion of the video signal as an attribute relating 
to the video signal, and outputs video resolu- 
tion data indicating the resolution; and 
when the video resolution changes, said video 
object composer divides the video stream, on 
the basis of the video resolution data, such that 
a portion of the video stream before the video 
resolution change point and a portion of the 
video stream after the video resolution change 
point are output as different video object data. 
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14. The recording apparatus of Claim 10, wherein: 

said attribute detector detects the aspect ratio 
of the video signal as an attribute relating to the 
video signal, and outputs aspect ratio data indi- 
cating the aspect ratio; and 
when the aspect ration changes, said video 
object composer divides the video stream, on 
the basis of the aspect ratio data, such that a 
portion of the video stream before the aspect 
ratio change point and a portion of the video 
stream after the aspect ratio change point are 
output as different video object data. 

15. A coding apparatus for coding a video signal, com- 
prising: 

a video encoder for subjecting the video signal 
to intra-frame coding or inter-frame coding 
such that a group of frames including at least 
one frame subjected to the intra-frame coding 
is generated, and a video stream correspond- 
ing to the frame group is output as a stream 
unit that can be accessed randomly; 
a video attribute detector for detecting a video 
attribute of the video signal, and outputting 
video attribute data indicating the video 
attribute; and 

said video encoder forming the frame group 
such that a specific frame, whose video 
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attribute is different from that of a frame posi- 
tioned just before it, is stored as a head frame 
in the frame group. 

16. The coding apparatus of Claim 15 further compris- 
ing: 

an audio attribute detector for detecting an 
audio attribute of an audio signal appended to 
the video signal, and outputting audio attribute 
data indicating the audio attribute; and 
said video encoder forming the frame group 
such that a specific frame, whose video or 
audio attribute is different from that of a frame 
positioned just before it, is stored as a head 
frame in the frame group. 

17. The coding apparatus of claim 16 further compris- 
ing: 

a packing unit for performing packing to divide 
the video stream into plural streams corre- 
sponding to packs as data units each having a 
predetermined size, and outputting the stream 
corresponding to each pack as pack data; and 
said packing unit performing the packing such 
that a position in the video stream, where the 
video or audio attribute changes, is positioned 
at the head of the pack. 

18. The coding apparatus of Claim 15, wherein said 
video encoder performs coding on each frame in a 
specific frame group including the specific frame, 
without referring to the video signal corresponding 
to frames in a frame group which has been coded 
previously to the specific frame group. 
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corresponding to each pack as pack data; and 
said packing unit performing the packing such 
that a position in the video stream, where the 
video resolution changes, is positioned at the 
head of the pack. 

21. The coding apparatus of Claim 15 wherein: 

said attribute detector detects the aspect ratio 
of the video signal as the video attribute on the 
basis of the video signal, and outputs aspect 
ratio data indicating the aspect ratio; and 
said video encoder forms the frame group, on 
the basis of the aspect ratio data, such that a 
specific frame, whose aspect ratio is different 
from that of a frame positioned just before it, is 
stored as a head frame in the frame group. 

22. The coding apparatus of claim 21 further compris- 
ing: 

a packing unit for performing packing to divide 
the video stream into plural streams corre- 
sponding to packs as data units each having a 
predetermined size, and outputting the stream 
corresponding to each pack as pack data; and 
said packing unit performing the packing such 
that a position in the video stream, where the 
aspect ratio changes, is positioned at the head 
of the pack. 



19. The coding apparatus of Claim 15, wherein: 



said video attribute detector detects the video 40 
resolution of the video signal as a video 
attribute on the basis of the video signal, and 
outputs video resolution data indicating the 
video resolution; and 

said video encoder forms the frame group on 45 
the basis of the video resolution data, such that 
a specific frame, whose video resolution is dif- 
ferent from that of a frame positioned just 
before it, is stored as a head frame in the frame 
group. so 

20. The coding apparatus of Claim 19 further compris- 
ing: 

a packing unit for performing packing to divide 55 
the video stream into plural streams corre- 
sponding to packs as data units each having a 
predetermined size, and outputting the stream 
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